Bala Kumaravel
I previously worked as a Senior Researcher at Microsoft Research, Redmond in the Interactive Multimodal AI Systems group, where I led and contributed to multimodal generative AI systems for productivity, collaboration, and creative tools across domains including Office Copilot workflows, Microsoft Teams, Bing Ads, and Minecraft agents.
I’m now building a startup focused on making the internet’s information more usable for humans and AI agents.
Before Microsoft, I completed my Ph.D. at the University of California, Berkeley, where I was advised by Prof. Björn Hartmann. My research at Berkeley focused on Virtual and Augmented Reality, spanning applications from AR/VR-assisted robotics interactions to learning and guidance. Before that, I completed my bachelor’s at the Indian Institute of Technology, Madras, where my thesis won both the best interdisciplinary thesis project across all engineering departments and the best thesis in the department.
If you’re exploring multimodal LLMs, diffusion models, or embodied AI to improve human-AI interaction, I’d love to connect.
news
| Jul 15, 2025 | Our work - ‘Grounding Task Assistance with Multimodal Cues from a Single Demonstration’ was accepted and presented at ACL’25 Findings link |
|---|---|
| Oct 21, 2024 | I will be speaking at panel discussion at the IEEE International Symposium on Emerging Metaverse on Oct 21st 2024 link |
| Oct 16, 2024 | We presented our work on BlendScape and SpaceBlender at UIST 2024. BlendScape won a Honorable Mention Award at UIST 2024. Check out the works at BlendScape and SpaceBlender. |
| May 11, 2024 | We presented our work on SharedNeRF at CHI 2024. SharedNeRF won a Honorable Mention Award at CHI 2024. Check out the work at SharedNeRF. |
| Mar 16, 2024 | Moved to the Interactive Multimodal AI Systems team at Microsoft Research, Redmond |
selected publications
2026
2025
-
Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint FramesarXiv preprint arXiv:2505.24257, 2025
2024
-
BlendScape: Enabling End-User Customization of Video-Conferencing Environments through Generative AIIn Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, Jul 2024 -
SpaceBlender: Creating Context-Rich Collaborative Spaces Through Generative 3D Scene BlendingIn Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, Jul 2024 -
-
BlendScape: Enabling Unified and Personalized Video-Conferencing Environments through Generative AIIn , Jul 2024
2023
-
StreamFunnel: Facilitating Communication Between a VR Streamer and Many SpectatorsIn , Jul 2023