Bala Kumaravel

prof_pic.png

I previously worked as a Senior Researcher at Microsoft Research, Redmond in the Interactive Multimodal AI Systems group, where I led and contributed to multimodal generative AI systems for productivity, collaboration, and creative tools across domains including Office Copilot workflows, Microsoft Teams, Bing Ads, and Minecraft agents.

I’m now building a startup focused on making the internet’s information more usable for humans and AI agents.

Before Microsoft, I completed my Ph.D. at the University of California, Berkeley, where I was advised by Prof. Björn Hartmann. My research at Berkeley focused on Virtual and Augmented Reality, spanning applications from AR/VR-assisted robotics interactions to learning and guidance. Before that, I completed my bachelor’s at the Indian Institute of Technology, Madras, where my thesis won both the best interdisciplinary thesis project across all engineering departments and the best thesis in the department.

If you’re exploring multimodal LLMs, diffusion models, or embodied AI to improve human-AI interaction, I’d love to connect.

news

Jul 15, 2025 Our work - ‘Grounding Task Assistance with Multimodal Cues from a Single Demonstration’ was accepted and presented at ACL’25 Findings link
Oct 21, 2024 I will be speaking at panel discussion at the IEEE International Symposium on Emerging Metaverse on Oct 21st 2024 link
Oct 16, 2024 We presented our work on BlendScape and SpaceBlender at UIST 2024. BlendScape won a Honorable Mention Award at UIST 2024. Check out the works at BlendScape and SpaceBlender.
May 11, 2024 We presented our work on SharedNeRF at CHI 2024. SharedNeRF won a Honorable Mention Award at CHI 2024. Check out the work at SharedNeRF.
Mar 16, 2024 Moved to the Interactive Multimodal AI Systems team at Microsoft Research, Redmond

selected publications

2026

  1. Proscenium.png
    Proscenium: Exploring Design Spaces of Layered Information Experience on a Large Dual-Layer Transparent Display
    Chen Chen, Michel Pahud, David Brown, Chuck Needham, Balasaravanan Thoravi KumaravelAndrew D Wilson, Ken Hinckley, and Nicolai Marquardt
    arXiv preprint arXiv:2603.01238, 2026
  2. MineNPC-Task.png
    MineNPC-Task: Task Suite for Memory-Aware Minecraft Agents
    Tamil Sudaravan Mohan Doss, Michael Xu, Sudha Rao, Andrew D Wilson, and Balasaravanan Thoravi Kumaravel
    arXiv preprint arXiv:2601.05215, 2026

2025

  1. outofsight.png
    Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames
    Sahithya RaviGabriel SarchVibhav VineetAndrew D Wilson, and Balasaravanan Thoravi Kumaravel
    arXiv preprint arXiv:2505.24257, 2025
  2. mica.png
    Grounding Task Assistance with Multimodal Cues from a Single Demonstration
    Gabriel SarchBalasaravanan Thoravi KumaravelSahithya RaviVibhav Vineet, and Andrew D Wilson
    In Findings of the Association for Computational Linguistics: ACL 2025, Jul 2025
  3. DocToFuture.png
    Doc To The Future: Infomorphs for Interactive, Multimodal Document Transformation and Generation
    Balasaravanan Thoravi Kumaravel
    arXiv preprint arXiv:2602.23366, Jul 2025

2024

  1. blendscape.png
    BlendScape: Enabling End-User Customization of Video-Conferencing Environments through Generative AI
    Shwetha RajaramNels NumanBalasaravanan Thoravi Kumaravel, Nicolai Marquardt, and Andrew D Wilson
    In Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, Jul 2024
  2. spaceblender.jpg
    SpaceBlender: Creating Context-Rich Collaborative Spaces Through Generative 3D Scene Blending
    Nels NumanShwetha RajaramBalasaravanan Thoravi Kumaravel, Nicolai Marquardt, and Andrew D Wilson
    In Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, Jul 2024
  3. sharednerf.png
    SharedNeRF: Leveraging Photorealistic and View-dependent Rendering for Real-time and Remote Collaboration
    Mose SakashitaBalasaravanan Thoravi Kumaravel, Nicolai Marquardt, and Andrew David Wilson
    In Proceedings of the CHI Conference on Human Factors in Computing Systems, Jul 2024
  4. blendscape.png
    BlendScape: Enabling Unified and Personalized Video-Conferencing Environments through Generative AI
    Shwetha RajaramNels NumanBalasaravanan Thoravi Kumaravel, Nicolai Marquardt, and Andrew D Wilson
    In , Jul 2024

2023

  1. streamfunnel.png
    StreamFunnel: Facilitating Communication Between a VR Streamer and Many Spectators
    Haohua LyuCyrus VachhaQianyi ChenBalasaravanan Thoravi Kumaravel, and Bjoern Hartmann
    In , Jul 2023

2022

  1. crossroads.png
    Shaping the new future of work through mixed reality
    Balasaravanan Thoravi Kumaravel
    Jul 2022
  2. thesis.png
    Interactive Cross-Dimensional Media for Collaboration and Guidance in Mixed Reality Environments
    Balasaravanan Thoravi Kumaravel
    University of California, Berkeley, Jul 2022
  3. itsc.png
    Modeling and Influencing Human Attentiveness in Autonomy-to-Human Perception Hand-offs
    In 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), Jul 2022
  4. dreamstream.gif
    DreamStream: Immersive and Interactive Spectating in VR
    Balasaravanan Thoravi Kumaravel, and Andrew D Wilson
    In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems, Jul 2022

2020

  1. transceivr.jpg
    TransceiVR: Bridging asymmetrical communication between VR users and external collaborators
    Balasaravanan Thoravi KumaravelCuong NguyenStephen DiVerdi, and Bjoern Hartmann
    In Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology, Jul 2020

2019

  1. tutorivr.jpg
    TutoriVR: A Video-Based Tutorial System for Design Applications in Virtual Reality
    Balasaravanan Thoravi KumaravelCuong NguyenStephen DiVerdi, and Bjoern Hartmann
    In CHI ’19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, Jul 2019
  2. loki.jpg
    Loki: Facilitating remote instruction of physical tasks using bi-directional mixed-reality telepresence
    Balasaravanan Thoravi KumaravelFraser AndersonGeorge FitzmauriceBjoern Hartmann, and Tovi Grossman
    In Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology, Jul 2019

Mentoring

Sahithya Ravi (PhD Student, UBC Vancouver) 2024
Gabriel Sarch (PhD Student, Carnegie Mellon University) 2024
Jialu Gao (Masters Student, Carnegie Mellon University) 2024
Cyrus Vaccha (Masters Student, UC Berkeley) 2023
Mose Sakashita (PhD Student, Cornell) 2023
Nels Numan (PhD Student, UCL) 2023
Shwetha Rajaram (PhD Student, UMich) 2023
Stephanie Claudino Daffara (Masters Student, UC Berkeley) 2020
Erin Kraemer (Bachelors Student, UC Berkeley) 2019