DOI: 10.3390/electronics15132832 ISSN: 2079-9292

Quantum-Accelerated Artificial Intelligence for Edge Devices: A Review of Encodings, Models, Hybrid Architectures, and NISQ-Era Realities

Rita Singh, Angel Deborah Suseelan

Edge artificial intelligence (Edge AI) requires real-time inference under stringent constraints on computation, memory, energy, and connectivity. Although training can be offloaded to servers, efficient, high-capacity inference and rapid on-device adaptation remain central challenges. Cloud-based inference offers substantial computational power but depends on connectivity, latency, privacy, and reliability conditions that edge deployments cannot always guarantee. Classical model-compression methods—including quantization, pruning, distillation, and neural architecture search—have extended the feasibility of on-device inference, yet they leave largely unchanged the fundamental cost of the linear-algebraic, sampling, and optimization primitives that dominate modern deep learning. Quantum computing has therefore been proposed as a complementary accelerator for selected AI workloads, with theoretical advantages in linear systems, singular value decomposition, sampling, kernel evaluation, and optimization. This review surveys the emerging field of quantum-accelerated AI for edge systems under a hybrid architectural premise: edge devices remain classical, while quantum processors operate as remote, cloud, MEC, or near-edge accelerators. We synthesize advances across quantum learning models, hybrid optimization methods, hardware and deployment architectures, and quantum-inspired approaches suitable for constrained devices. We also assess the practical barriers that currently separate asymptotic quantum advantage from deployable edge intelligence, including data loading, measurement overhead, noise, latency, and benchmarking gaps. Finally, we outline a staged research roadmap from near-term hybrid workflows to fault-tolerant and integrated quantum-edge architectures.

More from our Archive