blog | Juan Carlos Niebles

AI Agents: From Language to Multimodal Reasoning

An outline of our recent journey on AI Agents

3 min read · October 20, 2025

2025 · agents multimodal-ai · research
Level up your Agents: Teaching Vision-Language Models to Play by the Rules

We explore how Vision-Language Models can be improved for interactive decision-making by using our new reinforcement learning technique called Advantage-Filtered Supervised Fine-Tuning.

9 min read · June 04, 2025

2025 · multimodal-ai agents · research
Will AI Empower or Eclipse Human Creativity? An Emerging Question in the Age of Intelligent Machines

A debate of whether AI will empower or eclipse human creativity, considering its potential as a powerful tool versus the risk of over-reliance diminishing our innate abilities.

3 min read · May 07, 2025

2025 · ai-salon
The Generalization Gambit - Does It Still Matter in the Age of Big AI?

Big AI models are now trained on tons of data, is "generalization" (performing on new data) still important? Is generalization an outdated concept or crucial for real-world reliability?

4 min read · April 22, 2025

2025 · ai-salon
Are your Visual Programs Right for the Wrong Reasons?

ViUniT, Making AI visual reasoning reliable, one unit test at a time.

5 min read · April 17, 2025

2025 · multimodal-ai · research

site stats