Professional Documents
Culture Documents
AI Trends of May 2023 You Need To Know by Gonzalo Recio Medium
AI Trends of May 2023 You Need To Know by Gonzalo Recio Medium
Listen OpenShare
in app More
HuggingChat
The first open source
alternative to…
ChatGPT.
huggingface.co
HuggingChat screenshot.
Transformers Agent
Transformers Agent is
an experimental API…
which is subject to
huggingface.co
change at any time.
Results returned by
the agents can…
Tools Generation
RestrictedPython
Tools:
•image_generator
interpreter
Instruction
•image_captioner
Readoutloudthe "Ariverflowing
contentoftheimage throughafrozen
forest"
Prompt
caption=image_captioner(image)
Twillaskyoutoperformatask,yourjobistocomeupwitha audio=texttospeech(caption)
seriesofsimplecommandsinPythonthatwillperformthe
task
Youcanprintintermediateresultsifitmakessensetodoso.
Tools:
•image_generator:Thisisatoolthatgeneratesanimage Agent
•image_captioner:Thisisatoolthatcaptionsanimage
text_to_speech:Convertsthetexttoaudio
«Examplesoftasks>
willusetheimage_captionertocaptiontheimage
Task:"Readoutloudthecontentoftheimage" andthetext_tospeechtoreaditoutloud.
Source: https://github.com/kagisearch/pyllms
ChatGPT Prompt
Engineering for…
Developers
In ChatGPT Prompt
Engineering for…
Developers, you will
www.deeplearning.ai
learn how to use a
large language model
(LLM) to quickly
Bing Chat Update: Images, videos & pluguins!
build…
BingChat now includes images and videos
in replies and chat history, and in the near
future it will support multimodal support
and plugin usage among many other
things. Take a look at the following link to
see what is to come:
Computer Vision
Midjourney 5.1
New version of the model released, with
significant improvements in prompt
unsderstanding, sharpness of the images,
as well as reduction of borders and
unwanted text artifacts.
Midjourney
@midjourney · Follow
DeepFloyd IF
Stability AI releases DeepFloyd IF, a
powerful text-to-image model that can
smartly integrate text into images.
Incorporating the large language model
T5-XXL-1.1 as a text encoder, DeepFloyd IF
generates coherent and clear text
alongside objects of different properties.
Yann LeCun
@ylecun · Follow
Paper: dl.fbaipublicfiles.com/imagebind/imag…
Demo: imagebind.metademolab.com
Code:… Show more
Read 48 replies
ImageBind: https://arxiv.org/abs/2305.05665
YOLO-NAS
After the recent release of YOLOv8 by
Ultralytics, Deci.ai presents a new
Foundation Object Detection Model
providing Production-Ready performance.
It significantly improves inference
performance while preserving detection
accuracy.
pandas_ai.run(
df,
"Plot the histogram of countries showi
)
le13 GDPbyCountry
2.00
1.75-
1.50
1.25
1.00
0.75
0.50
0.25
0.00
UnitedStates-
UnitedKingdom.
Japan.
Australia
Spain
Canada
Italy
Germany
China
France
Country
https://github.com/gventuri/pandas-ai
Achairthatlooks Anairplanethatlooks
Aspaceship
likeanavocado likeabanana
Achairthatlooks
Abirthdaycupcake Agreenboot
likeatree
Mojo
Mojo — a new programming language
for all AI developers that combines the
usability of Python with the performance
of C, unlocking unparalleled
programmability of AI hardware and
extensibility of AI models.
Mindblowing research
AI can read minds?
Over the last months, a new method based
on a diffusion model (DM) was proposed
to reconstruct images from human brain
activity obtained via functional magnetic
resonance imaging (fMRI). Recently,
researchers from the University of Texas at
Austin introduced a non-invasive decoder
that reconstructs continuous language
from fMRIs too. These findings
demonstrate the viability of brain–
computer interfaces to enhace human
interactions with machines.