CarlosGG's Knowledge Garden 🪴

Search

Search IconIcon to open search

VLMs

Last updated Nov 14, 2024 Edit Source

Vision language models are models that can learn simultaneously from images and texts to tackle many tasks, from visual question answering to image captioning

# Resources

# Code

# References