Revolutionize Your Financial Journey with Smart Finance

Boosting AI performance at the edge necessitates selecting appropriate processors and storage solutions

Artificial Intelligence is moving away from data centre-dominated models reliant on GPUs towards energy-efficient, low-power alternatives designed specifically for edge devices.

, and Administrator

2025 July 17 . 10:51 AM

2 min read

Pushing AI processing at the periphery necessitates suitable processors and memory systems

Boosting AI performance at the edge necessitates selecting appropriate processors and storage solutions

In the rapidly evolving world of technology, the focus on efficient compute architectures and specialized AI models for power-constrained edge computing systems is gaining momentum. This shift is crucial for enabling practical AI and real-time processing directly on edge devices, such as IoT devices and smart cameras.

One of the key trends is the shift towards efficient, low-power AI models tailored for the edge. Traditional data center GPU-heavy training models are being replaced with specialized AI models optimized for on-device inference in power- and memory-constrained environments. These models strive for maximum efficiency in terms of performance-per-watt, often measured in tera operations per second per watt (TOPS/W), balancing memory usage and compute to fit limited edge hardware.

Another trend revolves around rethinking compute architectures to reduce latency and bandwidth needs. Edge computing infrastructure is designed to minimize latency by bringing compute closer to the data source, which also reduces extensive data movement and bandwidth consumption. This is critical for applications requiring ultra-low latency such as AR/VR, autonomous systems, and remote surgery.

Architectures combine processing and storage intelligently to minimize data transfer bottlenecks and optimize memory bandwidth locally, often using hyperconverged edge systems that integrate compute, storage, and network resources to improve performance and resilience.

The processors deployed at the edge are being engineered specifically to balance compute throughput and memory bandwidth with low energy consumption. These may include specialized AI accelerators that handle memory-intensive operations more efficiently than generic CPUs or GPUs, ensuring faster model inference with constrained memory bandwidth.

The use of hyperconverged systems and distributed edge architectures provides scalable, fault-tolerant infrastructure that can perform consistently in diverse and often power-limited edge environments. These systems aim to optimize resource use, managing both compute and memory intelligently to ensure performance without exceeding power budgets.

A prime example of this advancement can be seen in the collaboration between Micron and Hailo. Micron's LPDDR4X, ideally suitable for Hailo's VPU, delivers high performance, high-bandwidth data rates without compromising power efficiency. The combination of Micron's LPDDR technology and Hailo's AI processors allows a broad range of applications, from industrial and automotive to enterprise systems.

Micron's LPDDR technology offers high-speed, high-bandwidth data transfer without sacrificing power efficiency for embedded AI applications. As the industry continues to shift towards more efficient compute architectures and specialized AI models for low-power applications, Micron's 1-beta LPDDR5X doubles the performance of LPDDR4, reaching up to 9.6 Gbits/s per pin and delivering 20% better power efficiency compared to LPDDR4X.

In conclusion, the developments in efficient compute architectures and specialized AI models for power-constrained edge computing systems are transforming the landscape of AI inference at the edge. By tackling memory bandwidth limitations and performance demands within tight power envelopes, these advancements are enabling practical AI and real-time processing directly on edge devices, making millions or billions of endpoints AI-enabled edge systems a reality.

Technology advancements in compute architectures and AI models are addressing power-constrained edge computing systems, aimed at improving performance-per-watt for on-device inference. The focus is on optimizing memory usage and compute to fit edge hardware, with the goal of making AI-enabled edge systems a reality for millions or even billions of endpoints.

Latest

In this image I can see the watch. Background is in black and brown color.

Explore Latest Tech Innovations

Cartier Introduces New Santos de Cartier Steel & Titanium Models

Discover the latest Santos de Cartier watches. The steel model is available now, while the titanium version arrives in November.

, and Administrator

2025 October 9

In this image, we can see an advertisement contains robots and some text.

Protect Your Finances Online

Australian Organisations Face Growing Ransomware Threat via Supply Chains

Supply chains are the new frontline in the battle against ransomware. Australian organisations must improve communication and enforce robust security standards to protect themselves and their partners.

, and Administrator

2025 October 9

This is a paper. On this something is written.

Finance

Australian Businesses Struggle with Cybersecurity Transparency, Seek Government Standards

Businesses fear sharing cyber info may hinder law enforcement. Customers want better data protection and transparency.

, and Administrator

2025 October 9

This looks like an edited image. I think these are the parts of a vehicle. I can see the letters,...

Automotive

Cupra Unveils Most Powerful Formentor Yet: VZ5 in 2026

Cupra's new VZ5 is a powerful, exclusive SUV. With its striking design and limited numbers, it's set to be a standout in the performance market.

, and Administrator

2025 October 9

Boosting AI performance at the edge necessitates selecting appropriate processors and storage solutions

Boosting AI performance at the edge necessitates selecting appropriate processors and storage solutions

Read also:

Related

Latest