CSE 195 / EECS 295: EECS Research

Fall 2026

Wednesdays, 11:30 AM – 12:30 PM

Aug 26

Introductions
Sep 2

nuScenes: A Multimodal Dataset for Autonomous Driving

Presented by Erika
Sep 9

nuPlan: A Closed-Loop ML-Based Planning Benchmark for Autonomous Vehicles

Presented by Parthib
Sep 16

No meeting
Sep 23

Alpamayo

Presented by Maitrayee
Sep 30

123D: Unifying Multi-Modal Autonomous Driving Data at Scale

Presented by Angel
Oct 7

Emerging Properties in Self-Supervised Vision Transformers

Presented by Kianna
Oct 14

Open-World Semantic Segmentation Including Class Similarity

Presented by Maitrayee
Oct 21

Reducing Network Agnostophobia

Presented by David
Oct 28

A Simple Framework for Contrastive Learning of Visual Representations

Presented by Ricardo
Nov 4

BEV-LLM: Leveraging Multimodal BEV Maps for Scene Captioning in Autonomous Driving

Presented by Giovanni
Nov 11

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Presented by Angel
Nov 18

Robust Speech Recognition via Large-Scale Weak Supervision

Presented by Wesley
Nov 25

No meeting
Dec 2

Token-Efficient Long Video Understanding for Multimodal LLMs

Presented by Kianna
Dec 9

Wolf: Dense Video Captioning with a World Summarization Framework

Presented by David

Exploring the Potential of Multi-Modal AI for Driving Hazard Prediction

Presented by Angel
Delving Into Multi-Modal Multi-Task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

Presented by Ricardo
Learning Transferable Visual Models From Natural Language Supervision

Presented by Kianna
Lessons in Cooperation: Driver Sentiments Toward Real-Time Advisory Systems

Presented by Tiep

Sep 3

LERF: Language Embedded Radiance Fields

Presented by Shyam
Sep 10

Accessibility for Whom?

Presented by Wesley
Sep 24

DriveLLaVA: Human-Level Behavior Decisions via Vision Language Model

Presented by Kianna
Oct 1

DriveLM: Driving with Graph Visual Question Answering

Presented by Angel

Jan 28

Hydra: Foundations of Spatial Perception for Robotics — Hierarchical Representations and Real-time Systems

Presented by Aryan
MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding

Presented by Shashank
Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning

Presented by Srini

Feb 11

Tutorial / Demo Using UCM Pinnacles

Presented by Shashank
Feb 18

Trimodal Contrastive Loss for Text-to-Shape Retrieval

Presented by Guillermo
Mar 11

ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning

Presented by Srini
Mar 18

ViNT: A Foundation Model for Visual Navigation

Presented by Harsha
Apr 1

GNM: A General Navigation Model to Drive Any Robot

Presented by Parthib
Apr 8

SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models

Presented by Aryan
Apr 15

LERF: Language Embedded Radiance Fields

Presented by Shashank
Oct 29

Clio: Real-time Task-Driven Open-Set 3D Scene Graphs

Presented by Aryan