Current Location: > Detailed Browse

Empowering Large Language Models to Edge Intelligence: A Survey of Edge Efficient LLMs and Techniques

请选择邀稿期刊:
Abstract: Large language models (LLMs) have showcased exceptional capabilities across various natural language processing (NLP) tasks in recent years, such as machine translation, text summarization, and question answering. Despite their impressive performance, the deployment of these models on edge devices, such as mobile phones, IoT devices, and edge computing nodes, is significantly hindered by their substantial computational and memory requirements. This survey provides a comprehensive overview of the state-of-the-art techniques and strategies for enabling efficient inference of LLMs on edge devices. We explore approaches including the development of small language models (SLMs), model compression techniques, inference optimization strategies, and dedicated frameworks for edge deployment. Our goal is to highlight the advancements and ongoing challenges in this field, offering valuable insights for researchers and practitioners striving to bring the power of LLMs to edge environments.

Version History

[V1] 2024-11-25 10:02:34 ChinaXiv:202411.00258V1 Download
Download
Preview
Peer Review Status
Awaiting Review
License Information
metrics index
  •  Hits2597
  •  Downloads864
Comment
Share
Apply for expert review