Shobha Sundar Ram did her Bachelor of Technology in ECE from the University of Madras, India, in 2004, Master of Science and Ph.D. in electrical engineering from the University of Texas at Austin, USA, in 2006 and 2009, respectively. She worked as a research and development electrical engineer at Baker Hughes Inc., USA, from 2009 to 2013. She joined Indraprastha Institute of Information Technology as an Assistant Professor in 2013, where she is currently a Professor in the Department of Electronics and Communications Engineering. She is primarily engaged in research and education, principally in the areas of radar systems and electromagnetic sensor design and modeling. She serves as an Associate
Editor for the IEEE Transactions on Aerospace and Electronics Systems, IEEE Transactions on Radar Systems, and is a Guest Editor for IEEE Transactions on Computational Imaging. She has served as a Guest Editor for IET Radar, Sonar, and Navigation and Topics Editor for Frontiers of Signal Processing. She currently serves as the Secretary of the IEEE Microwave Theory and Techniques Society (MTT-S). She is a Vice-Chair of the Young Professionals Standing Committee under the MTT-S Administrative Committee. She is a member of the following technical committees: the Radar System Panel of the IEEE Aerospace and Electronics Systems Society (AESS), the Microwave/mm-Wave Radar, Sensing and Array Systems Technical Committee -24 of MTT-S, and the Sensor Array and Multichannel (SAM) Technical Committee of the IEEE Signal Processing Society (SPS). She is also a member of the following working groups: the Integrated Sensing and Communications Working Groups of MTT-S and AESS, and the Synthetic Aperture Working Group of SPS. She has served in the Technical Program Committee of several international conferences and was the Technical Program Committee Co-Chair of the IEEE Microwave, Antennas, and Propagation Conference (MAPCON) in 2024. She has won several student paper awards for co-authored works in the USA, Europe, and India, and the Qualcomm Innovation Fellowship India award for 2022 and 2024.
Abstract: A key objective of next generation intelligent transportation services is to enable high data rate vehicular communications to enable sharing of road and environmental information for improved road safety. Existing sub-6GHz vehicular communication protocols do not support the large bandwidths required for high throughput and low latency communications. Millimeter wave (mmW) IEEE 802.11ad protocol with narrow high-directional beams has been proposed for supporting high bandwidth connected vehicles. The standard IEEE 802.11ad protocol enables beam alignment between the base station (BS) and mobile unit (MU) through a lengthy beam training procedure accomplished through additional packet overhead. However, this results in reduced latency and throughput
In this talk, I will present an integrated sensing and communications (ISAC) system solution where the communication protocol is embedded with auxiliary radar functionality to enable rapid beam alignment of communication beams without the requirement of channel overheads. Further, I will present a complete architectural framework of the 802.11ad-based ISAC system transceiver along with the signal processing algorithms and performance analysis on simulated target scenarios. Next, I will present the hardware prototype of the ISAC system comprising the three-dimensional efficient reconfigurable digital hardware radar signal processing architecture and communication system on multiprocessor system-on-chip; an mmWanalog front end integrated with the digital frontend; and over-the-air performance characterization in real radio environments.
Raj Kumar received the B.E. degree from the University of Madras, the M.Sc.(Eng.) degree from the Indian Institute of Science, Bangalore, and the Ph.D. degree from the University of Southern California (USC), Los Angeles (specializing in coding and information theory). He was subsequently a post-doctoral researcher at the École Polytechnique Fédérale de Lausanne (EPFL, Switzerland) for a couple of years, and has been with Qualcomm’s Wireless R&D group in Bangalore since 2011. At Qualcomm, he has worked on a wide variety of projects spanning algorithm design, standardization and commercialization of 3G to 6G cellular systems, WiFi, WiGig (60GHz mmWave) and low earth orbit satellite communication systems, as well as advanced positioning and wireless sensing (ISAC) solutions.
Abstract: Wireless sensing is emerging as a key capability in next-generation networks, enabling a wide range of use cases such as device-free localization, activity monitoring, smart infrastructure, and digital twins. The talk will highlight the standards evolution from 5G-Advanced toward 6G ISAC as a native platform capability. It will also present real-world prototype results, demonstrating accurate sensing across various scenarios. These results illustrate the transition from research to deployable systems for future wireless networks.
Kumar Vijay Mishra (S’08-M’15-SM’18) obtained a Ph.D. in electrical engineering and M.S. in mathematics from The University of Iowa in 2015, and M.S. in electrical engineering from Colorado State University in 2012, while working on NASA’s Global Precipitation Mission Ground Validation (GPM-GV) weather radars. He received his B. Tech. summa cum laude (Gold Medal, Honors) in electronics and communication engineering from the National Institute of Technology, Hamirpur (NITH), India in 2003. He is a Senior Fellow at the United States DEVCOM Army Research Laboratory; Research Scientist at the Institute for Systems Research, The University of Maryland, College Park under the ARL-ArtIAMAS program; 2026 BEL Endowed Visiting Chair Professor in Radar Systems at the Indian Institute of Science, Bangalore; Technical Adviser to Singapore-based automotive radar start-up Hertzwell; and honorary Research Fellow at SnT - Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg. Previously, he had research appointments at the Electronics and Radar Development Establishment (LRDE), Defence Research and Development Organisation (DRDO) Bengaluru; IIHR - Hydroscience & Engineering, Iowa City, IA; Mitsubishi Electric Research Labs, Cambridge, MA; Qualcomm, San Jose; and Technion - Israel Institute of Technology.
Dr. Mishra has served as the Distinguished Lecturer (DL) of various societies: IEEE Communications Society (2023-2024), IEEE Aerospace and Electronic Systems Society (AESS) (2023-2024, 2025-2026), IEEE Vehicular Technology Society (2023-2025, 2025-2027), and IEEE Geoscience and Remote Sensing Society (2024-2025). He has been a Virtual DL of IEEE Future Networks Initiative (2022) and Traveling Lecturer of Optica (2025-). He is the recipient of the IEEE AESS Harry Rowe Mimno Award (2026), SAE International Award for Excellence in Innovation (2025), IEEE Signal Processing Society Pierre-Simon Laplace Early Career Technical Achievement Award (2024), Special Mention for the IEEE AESS M. Barry Carlton Award (2023), IET Premium Best Paper Prize (2021), IEEE T-AES Outstanding Editor (2021, 2023, 2024, 2025), U. S. National Academies Harry Diamond Distinguished Fellowship (2018-2021), American Geophysical Union Editors' Citation for Excellence (2019), Royal Meteorological Society Quarterly Journal Editor's Prize (2017), Viterbi Postdoctoral Fellowship (2015, 2016), Lady Davis Postdoctoral Fellowship (2017), DRDO LRDE Scientist of the Year Award (2006), NITH Director’s Gold Medal (2003), and NITH Best Student Award (2003). He has received Best Paper Awards at IEEE MLSP 2019 and IEEE ACES Symposium 2019.
Dr. Mishra is Chair (2023-2026) of the International Union of Radio Science (URSI) Commission C, Chair (2025-) of IEEE AESS Technical Working Group on Integrated Sensing and Communications (ISAC-TWG), and Vice-Chair (2021-present) of the IEEE Synthetic Aperture Standards Committee, which is the first SPS standards committee. He has been Chair (2023-2025) of the IEEE SPS Synthetic Apertures Technical Working Group. He has been an elected member of three technical committees of IEEE SPS: SPCOM, SAM, and ASPS, and IEEE AESS Radar Systems Panel. He is Editor-in-Chief (EiC) of River Rapids Series in Radar Systems, Signal Processing, Antennas and Electromagnetics (2025-), Editor (Deputy EiC) of Radio Science (2025-), Senior Area Editor of IEEE Transactions on Signal Processing (2024-), and Associate Editor of IEEE Transactions on Aerospace and Electronic Systems (2020-) and IEEE Transactions on Antennas and Propagation (2023-). He has been a lead/guest editor of several special issues in journals such as IEEE Signal Processing Magazine, IEEE Journal of Selected Topics in Signal Processing, IEEE Journal on Selected Areas in Communications, and IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. He is the lead co-editor of several books on signal processing and radar: Signal Processing for Joint Radar-Communications (Wiley-IEEE Press, 2024), Next-Generation Cognitive Radar Systems (IET Press Electromagnetics and Radar Series, 2023), Advances in Weather Radar Volumes 1, 2 & 3 (IET Press Electromagnetics and Radar Series, 2023), and Handbook of Statistics 54: Multidimensional Signal Processing (Elsevier). His research interests include radar systems, signal processing, remote sensing, and electromagnetics.
Abstract: Electromagnetic waves are characterized by their amplitude, frequency, and state of polarization -- also referred to as spin angular momentum (SAM), associated with circular polarization and recognized for over two centuries. In contrast, over the past three decades, growing attention has been directed toward orbital angular momentum (OAM), which arises from helical (twisted) phase fronts and provides an additional degree of freedom through its orthogonal modal structure. In this talk, we explore the use of OAM for integrated sensing and communications (ISAC), with the goal of enhancing spectral efficiency and spatial resolution in automotive scenarios. Unlike conventional uniform circular array (UCA)-based implementations, our approach employs a uniform linear array (ULA) with traveling-wave antennas to synthesize multiple Laguerre-Gaussian vortex beams. The proposed system embeds communication data within radar waveforms while enabling joint estimation of target position and velocity using radar-only frames. An OAM-based mode-division multiplexing strategy is used to separate sensing and communication functionalities, ensuring reliable parameter recovery. We conclude with perspectives on the future of ISAC systems. [Joint work with Wanghan Lv]
S. Umesh is a Professor of Electrical Engineering at IIT Madras, where he leads the SPRING Lab. He and Prof. Hema Murthy serve as the National Coordinators of Speech Consortium for the Ministry of Electronics and Information Technology (MeitY) Bhashini initiative, overseeing speech and language technology development across 23 institutions in India. Umesh received his Ph.D. from the University of Rhode Island in 1993, followed by a postdoctoral fellowship at the City University of New York. Before joining IIT Madras, he spent over a decade at IIT Kanpur. He has also held visiting research positions at AT&T Research Labs, Cambridge University, and RWTH Aachen University. His research focuses on self-supervised learning for speech, speech representation learning, speaker normalization, and low-resource modelling, with applications to automatic speech recognition. He has also worked in the areas of statistical signal processing and time-varying spectral analysis.
Abstract: To address these limitations, we developed two new SSL models — cccWav2vec2.0 and Data2vec‑aqc — using noise augmentation, improved negative sampling, and cross‑contrastive objectives. On the SUPERB benchmark, they rank 5th and 6th while using only 1/60th of the training data and lesser parameters than many existing models. We further trained both models from scratch on 30,000 hours of Indian‑language speech and released them publicly. Fine‑tuning across 16 Indian languages yields state‑of‑the‑art ASR results.
Kruthika KR Dr. Kruthika KR is a Founding Researcher at SandLogic, where she works on building foundational AI models for speech and language applications. She is a co-author of Shakti LLM, a family of efficient large language models designed for enterprise and edge deployment. Her work focuses on speech AI, large language model architecture, and scalable deployment of AI systems across cloud and on-device environments. She specializes in developing production-ready AI models powering voice agents, enterprise copilots, and real-time AI applications.
Abstract: We propose a robust and computationally efficient architecture for Indic conversational automatic speech recognition (ASR), designed for real-world multi-accent, noisy, and spontaneous speech scenarios commonly observed in customer-agent calls and voice applications. Our work combines large-scale Hindi speech curation from ARTPARK–IISc Project Vaani with attention-free state-space sequence modeling to improve robustness, scalability, and deployment efficiency. We first curate a high-quality multi-accent Hindi subset (~366 hours) from the Vaani corpus through language filtering, transcript validation, normalization, and conversational data selection. Using this corpus, we fine-tune a Whisper-based streaming low-latency speech-to-text (SL-STT) baseline to better handle noisy and accent-diverse conversational speech. The resulting system achieves a ~21% relative reduction in WER on the LAHAJA benchmark and 50–55% relative gains on production-like customer-agent call datasets, highlighting strong generalization under realistic deployment conditions. Building on this baseline, we introduce a BiMamba-ASR architecture for Hindi speech recognition as an efficient alternative to transformer-based attention mechanisms. The proposed model consists of a convolutional acoustic encoder, followed by bidirectional Mamba residual blocks that capture long-range temporal dependencies with linear-time complexity, and a unidirectional Mamba decoder enabling constant-time token generation during inference. This design provides significant advantages in memory efficiency, long-context scalability, and streaming suitability, making it well aligned with edge and low-latency deployment requirements. A key architectural contribution is a Mamba-based cross-modal fusion module that replaces conventional cross-attention for audio-text interaction. The module incorporates learnable temporal alignment, adaptive fusion of global and local acoustic context, and gated linguistic conditioning, enabling effective integration of acoustic and textual representations while preserving computational efficiency. This attention-free fusion mechanism forms the central novelty of the proposed architecture. We further validate the robustness of the system on a real-world Hindi benchmark (~5 hours, 227 samples) curated from diverse YouTube sources containing varied accents, speaking styles, and noisy acoustic environments. The proposed SL-STT system achieves a WER of 16.5%, approaching commercial systems such as Deepgram Nova-2 (13.8%), demonstrating strong real-world performance. Beyond ASR, the same efficient design principles are extended to lightweight Indic TTS systems Model for languages such as Bhojpuri, Marathi, and Bengali using the Syspin dataset, showing the broader applicability of scalable speech architectures across Indic voice technologies. Overall, our work demonstrates that large-scale curated Indic datasets combined with efficient BiMamba-based attention-free architectures offer a practical and scalable path toward production-grade speech systems for real-world multilingual deployment.
Sourav Bandyopadhyay is the Co-founder and Chief Scientist of Shunyalabs Research Private Limited and a PhD Scholar at IIT Kharagpur, working at the intersection of artificial intelligence, language, and fundamental theory. With an exceptional track record that includes 12 patents, 18 research publications, and 23 world records, his work challenges conventional thinking in AI, most notably his assertion that large language models are inherently prone to hallucination, grounded in Gödel’s Incompleteness Theorem. Driven by a deep obsession with correctness and intellectual rigor, Sourav brings a rare blend of theoretical insight and real-world application. Beyond research, he is also an active political strategist, applying analytical thinking to complex societal systems. His work consistently pushes the boundaries of what reliable, trustworthy AI can and should be.
Abstract: No standardized annotation format exists for stuttered speech, preventing data pooling across datasets and limiting ASR improvement for the roughly 80 million people worldwide who stutter. Existing datasets (SEP-28k, FluencyBank, UCLASS, KSoF) use incompatible schemes with no shared severity model and no schema validation. We present SAML (Stuttering Annotation Markup Language), a transcript-level annotation system comprising a formal XML Schema (XSD) extending W3C SSML with 12 element types, a shorthand notation that reduces annotation effort by about 80%, and a reference Python converter with schema validation. The severity scale is aligned with the SSI-4 clinical instrument. We show that SAML covers the union of annotation features across all four datasets while adding percent severity, typed blocks, and machine-checkable constraints. All artifacts are released under the W3C Software and Document License.
Anand Deo is an Assistant Professor with the Decision Science at the Indian Institute of Management Bangalore. His research focuses on reliable decision making in the presence of extreme, high-impact events. Specific areas of interest include modelling and simulation of distribution tails, chance constrained optimisation, distributionally robust problems and quantitative finance. His work has appeared in leading journals and conferences in Operations Research, Financial Engineering and Control Theory. He won the 2024 I-SIM Best Publication award for his work on rare event simulation
Abstract: Chance-constrained optimization is a suitable modeling framework for safety-critical applications where violating constraints is nearly unacceptable. The scenario approach is a popular solution method for these problems, due to its straightforward implementation and ability to preserve problem structure. However, in the rare-event regime where constraint violations must be kept extremely unlikely, the scenario approach becomes computationally infeasible due to the excessively large sample sizes it demands. We address this limitation with a new yet straightforward decision-scaling method that relies exclusively on original data samples and a single scalar hyperparameter that scales the constraints in a way amenable to standard solvers. Our method leverages large deviation principles under mild nonparametric assumptions satisfied by commonly used distribution families in practice. For a broad class of problems satisfying certain practically verifiable structural assumptions, the method achieves a polynomial reduction in sample size requirements compared to the classical scenario approach, while also guaranteeing asymptotic feasibility in the rare-event regime. Numerical experiments spanning engineering applications show that our decision-scaling method significantly expands the scope of problems solvable both efficiently and reliably.
Pranay Sharma is an Assistant Professor at IIT Bombay in the Centre for Machine Intelligence and Data Science (C-MInDS). Till January 2025, he was a Research Scientist in the Department of Electrical and Computer Engineering at Carnegie Mellon University. In August 2021, he finished his PhD in Electrical Engineering and Computer Science at Syracuse University. Before that, he finished his B.Tech-M.Tech dual-degree in Electrical Engineering from IIT Kanpur. His research interests include federated and collaborative learning, stochastic optimization, reinforcement learning, and differential privacy.
Abstract:
Online federated learning (OFL) enables decentralized, private decision-making over continuous data streams. However, standard OFL adversary models often preclude parallelization benefits and fail to adequately capture diverse statistical variations. To address this, we integrate a stochastically extended adversary (SEA) in OFL, where the loss function remains fixed, but the adversary dynamically and independently selects each client's data distribution at each time step
We propose the FedSEA algorithm, utilizing online stochastic gradient descent at clients with periodic global server aggregation. We establish global network regret bounds over time horizon $T$: $O(\sqrt{T})$ for smooth, convex losses and $O(\log T)$ for smooth, strongly convex losses. By quantifying the individual impact of spatial and temporal data heterogeneity on these bounds, we identify a regime of mild temporal variation where network regret improves with parallelization, thereby improving upon the pessimistic worst-case results in standard OFL.