学院新闻

讲座信息

计算机学院系列讲座菁英论坛第29期——Multisensory Machine Intelligence

信息日期：2024-05-08 浏览量：

计算机学院系列讲座菁英论坛第29期——Multisensory Machine Intelligence

报告题目(Title)：Multisensory Machine Intelligence

时间(Date & Time)：2024.05.13 6：30 - 8：00 pm

地点(Location)：二教207（燕园校区） Room 207, Teaching Building #2 (Yanyuan Campus)

主讲人(Speaker)：Ruohan Gao (高若涵) from University of Maryland, College Park (UMD)

邀请人(Host)：Sheng Li (李胜)

报告摘要(Abstract)：

The future of Artificial Intelligence demands a paradigm shift towards multisensory perception—to systems that can digest ongoing multisensory observations, that can discover structure in unlabeled raw sensory data, and that can intelligently fuse useful information from different sensory modalities for decision making. While we humans perceive the world by looking, listening, touching, smelling, and tasting, traditional form of machine intelligence mostly focuses on a single sensory modality, particularly vision. My research aims to teach machines to see, hear, and feel like humans to perceive, understand, and interact with the multisensory world. In this talk, I will present my research of multisensory machine intelligence that studies two important aspects of the multisensory world: 1) multisensory objects, and 2) multisensory space. In both aspects, I will talk about how we design systems to reliably capture multisensory data, how we effectively model them with new differentiable simulation algorithms and deep learning models, and how we explore creative cross-modal/multi-modal applications with sight, sound, and touch.

主讲人简介(Bio)：

A person in a blue shirtDescription automatically generated

Dr. Ruohan Gao is an incoming assistant professor of the CS Department at University of Maryland, College Park, and currently a research scientist at Meta Reality Labs. Previously, he was a Postdoctoral Research Fellow working with Prof. Fei-Fei Li, Prof. Jiajun Wu, and Prof. Silvio Savarese in the Vision and Learning Lab at Stanford University. He obtained his Ph.D. advised by Prof. Kristen Grauman from The University of Texas at Austin. Ruohan mainly works in the fields of computer vision and machine learning with a specific focus on multisensory learning with sight, sound, and touch. His research has been recognized by the Michael H. Granof Award which is designated for UT Austin's Top 1 Doctoral Dissertation, the Google PhD Fellowship, the Adobe Research Fellowship, a Best Paper Award Runner Up at British Machine Vision Conference (BMVC) 2021, and a Best Paper Award Finalist at Conference on Computer Vision and Pattern Recognition (CVPR) 2019.

上一条：计算机学院系列讲座菁英论坛第30期——Towards Prevalence of On-Device AI with Full Runtime Adaptability

下一条：计算机学院系列讲座菁英论坛28期——End-to-End Mechanized Proof of an eBPF Virtual Machine for Micro-controllers

返回列表

请输入您搜索的信息！

学院新闻

学院新闻

讲座信息

计算机学院系列讲座菁英论坛第29期——Multisensory Machine Intelligence

信息日期：2024-05-08 浏览量：_showDynClicks("wbnews", 1934453449, 3014)

上一条：计算机学院系列讲座菁英论坛第30期——Towards Prevalence of On-Device AI with Full Runtime Adaptability

下一条：计算机学院系列讲座菁英论坛28期——End-to-End Mechanized Proof of an eBPF Virtual Machine for Micro-controllers

信息日期：2024-05-08 浏览量：