- agents
- multimodal-ai
- ai-salon
- research
•
•
•
-
My talks at CVPR 2026 workshops
This posts accompanies my talks at CVPR workshops. We explore ideas around ambient intelligence, visual task assistants and efficiently scaling transformers.
-
Agentic Ambient Intelligence: Bringing AI into the Physical World
We explore the future of Agentic Ambient Intelligence and discover how advancements in perception, reasoning, and motion-guided control are bridging the gap between digital AI capabilities and real-world physical assistance
-
AI Agents: From Language to Multimodal Reasoning
An outline of our recent journey on AI Agents
-
Level up your Agents: Teaching Vision-Language Models to Play by the Rules
We explore how Vision-Language Models can be improved for interactive decision-making by using our new reinforcement learning technique called Advantage-Filtered Supervised Fine-Tuning.
-
Will AI Empower or Eclipse Human Creativity? An Emerging Question in the Age of Intelligent Machines
A debate of whether AI will empower or eclipse human creativity, considering its potential as a powerful tool versus the risk of over-reliance diminishing our innate abilities.