Juan Carlos Niebles

bio

Juan Carlos Niebles received an Engineering degree in Electronics from Universidad del Norte (Colombia) in 2002, an M.Sc. degree in Electrical and Computer Engineering from University of Illinois at Urbana-Champaign in 2007, and a Ph.D. degree in Electrical Engineering from Princeton University in 2011.

He is Research Director at Salesforce and Adjunct Professor of Computer Science at Stanford since 2021. He is co-Director of the Stanford Vision and Learning Lab. Before that, he was Associate Director of Research at the Stanford-Toyota Center for AI Research and a Senior Research Scientist at the Stanford AI Lab between 2015 and 2021. He was also an Associate Professor of Electrical and Electronic Engineering in Universidad del Norte (Colombia) between 2011 and 2019.

His research interests are in computer vision and machine learning, with a focus on visual recognition and understanding of human actions and activities, objects, scenes, and events.

He serves as Area Chair for CVPR, ICCV and ECCV, as well as Associate Editor for IEEE TPAMI. He is also a member of the AI Index Steering Committee and is the Curriculum Director for Stanford-AI4ALL.

He is a recipient of a Google Faculty Research award (2015), the Microsoft Research Faculty Fellowship (2012), a Google Research award (2011) and a Fulbright Fellowship (2005).

research

My research work is in computer vision. The goal of my research is to enable computers and robots to perceive the visual world by developing novel computer vision algorithms for automatic analysis of images and videos.

From the scientific point of view, we tackle fundamental open problems in computer vision research related to the visual recognition and understanding of human actions and activities, objects, scenes, and events.

From the application perspective, we develop systems that solve practical world problems by introducing cutting-edge computer vision technologies into new application domains.

news

Jun 2024	I am an invited speaker at the Generalist Agent AI Tutorial and a panelist at the 2nd Workshop on What is Next in Multimodal Foundation Models? at CVPR 2024.
May 2024	We are releasing XGen-MM: a new 4.6B model that achieves state-of-the-art performance on few-shot learning and multimodal benchmarks.
Mar 2024	I am an Area Chair for ECCV 2024.

latest posts

Jun 2024	What is Next in Multimodal Foundation Models?
Jun 2024	Language-based AI Agents and Large Action Models (LAMs)
May 2024	The Roles of Open-Source and Closed-Source in the AI Race

selected publications

Streaming Detection of Queried Event Start

Cristobal Eyzaguirre, Eric Tang, Shyamal Buch, Adrien Gaidon, Jiajun Wu, and Juan Carlos Niebles

In Advances in Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks Track. Vancouver, Canada. Dec 2024

Website Data

X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning

Artemis Panagopoulou, Le Xue, Ning Yu, Junnan Li, Dongxu Li, Shafiq Joty, Ran Xu, Silvio Savarese, Caiming Xiong, and Juan Carlos Niebles

In European Conference on Computer Vision (ECCV). Milan, Italy. Oct 2024

arXiv Bib Code Poster Website Data

@inproceedings{ArtemisECCV2024,
  author = {Panagopoulou, Artemis and Xue, Le and Yu, Ning and Li, Junnan and Li, Dongxu and Joty, Shafiq and Xu, Ran and Savarese, Silvio and Xiong, Caiming and Niebles, Juan Carlos},
  title = {{X-InstructBLIP}: A Framework for Aligning Image, 3D, Audio, Video to {LLM}s and its Emergent Cross-modal Reasoning},
  booktitle = {European Conference on Computer Vision (ECCV)},
  address = {Milan, Italy},
  month = oct,
  year = {2024},
}

all publications