CSE 195 / EECS 295: EECS Research
Fall 2026
Wednesdays, 11:30 AM – 12:30 PM
-
Introductions
-
nuScenes: A Multimodal Dataset for Autonomous DrivingPresented by Erika
-
nuPlan: A Closed-Loop ML-Based Planning Benchmark for Autonomous VehiclesPresented by Parthib
-
No meeting
-
AlpamayoPresented by Maitrayee
-
123D: Unifying Multi-Modal Autonomous Driving Data at ScalePresented by Angel
-
Emerging Properties in Self-Supervised Vision TransformersPresented by Kianna
-
Open-World Semantic Segmentation Including Class SimilarityPresented by Maitrayee
-
Reducing Network AgnostophobiaPresented by David
-
A Simple Framework for Contrastive Learning of Visual RepresentationsPresented by Ricardo
-
BEV-LLM: Leveraging Multimodal BEV Maps for Scene Captioning in Autonomous DrivingPresented by Giovanni
-
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous DrivingPresented by Angel
-
Robust Speech Recognition via Large-Scale Weak SupervisionPresented by Wesley
-
No meeting
-
Token-Efficient Long Video Understanding for Multimodal LLMsPresented by Kianna
-
Wolf: Dense Video Captioning with a World Summarization FrameworkPresented by David
Spring 2026
-
Exploring the Potential of Multi-Modal AI for Driving Hazard PredictionPresented by Angel
-
Delving Into Multi-Modal Multi-Task Foundation Models for Road Scene Understanding: From Learning Paradigm PerspectivesPresented by Ricardo
-
Learning Transferable Visual Models From Natural Language SupervisionPresented by Kianna
-
Lessons in Cooperation: Driver Sentiments Toward Real-Time Advisory SystemsPresented by Tiep
Fall 2025
-
LERF: Language Embedded Radiance FieldsPresented by Shyam
-
Accessibility for Whom?Presented by Wesley
-
DriveLLaVA: Human-Level Behavior Decisions via Vision Language ModelPresented by Kianna
-
DriveLM: Driving with Graph Visual Question AnsweringPresented by Angel
Spring 2025
-
Hydra: Foundations of Spatial Perception for Robotics — Hierarchical Representations and Real-time SystemsPresented by Aryan
-
MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene UnderstandingPresented by Shashank
-
Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and ReasoningPresented by Srini
Fall 2024
-
Tutorial / Demo Using UCM PinnaclesPresented by Shashank
-
Trimodal Contrastive Loss for Text-to-Shape RetrievalPresented by Guillermo
-
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and PlanningPresented by Srini
-
ViNT: A Foundation Model for Visual NavigationPresented by Harsha
-
GNM: A General Navigation Model to Drive Any RobotPresented by Parthib
-
SpatialRGPT: Grounded Spatial Reasoning in Vision-Language ModelsPresented by Aryan
-
LERF: Language Embedded Radiance FieldsPresented by Shashank
-
Clio: Real-time Task-Driven Open-Set 3D Scene GraphsPresented by Aryan