Juan Carlos Niebles
  • about
  • news
  • blog (current)
  • datasets
  • publications
  • agents
  • •

  • multimodal-ai
  • •

  • ai-salon
  • •

  • research
  • Level up your Agents: Teaching Vision-Language Models to Play by the Rules

    We explore how Vision-Language Models can be improved for interactive decision-making by using our new reinforcement learning technique called Advantage-Filtered Supervised Fine-Tuning.

    9 min read   ·   June 04, 2025

    2025   ·   multimodal-ai   agents   ·   research

  • Will AI Empower or Eclipse Human Creativity? An Emerging Question in the Age of Intelligent Machines

    A debate of whether AI will empower or eclipse human creativity, considering its potential as a powerful tool versus the risk of over-reliance diminishing our innate abilities.

    3 min read   ·   May 07, 2025

    2025   ·   ai-salon

  • The Generalization Gambit - Does It Still Matter in the Age of Big AI?

    Big AI models are now trained on tons of data, is "generalization" (performing on new data) still important? Is generalization an outdated concept or crucial for real-world reliability?

    4 min read   ·   April 22, 2025

    2025   ·   ai-salon

  • Are your Visual Programs Right for the Wrong Reasons?

    ViUniT, Making AI visual reasoning reliable, one unit test at a time.

    5 min read   ·   April 17, 2025

    2025   ·   multimodal-ai   ·   research

  • Introducing TACO - Salesforce AI Research's Family of Multimodal Action Models - Salesforce

    We present TACO, a family of multi-modal large action models designed to improve performance on complex questions that require multiple capabilities and demand multi-step solutions. 

    7 min read   ·   January 16, 2025   ·   salesforce.com

    2025

  • Newer
  • 1
  • 2
  • 3
  • Older
© Copyright 2025 Juan Carlos Niebles. Last updated: June, 2025.