Latest Memory Compression

12d

Tether Brings AI Memory Compression To Consumer Devices

Tether’s TurboQuant enables useful and powerful local AI applications on consumer devices at much lower costs and without ...

Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit

LCLMs compress LLM context before decode — 8.8x faster at 16x compression, beating every KV cache method tested. Open-sourced by NYU and Columbia.

4don MSN

How memory tools can make AI models worse

New research suggests that AI memory systems can degrade model performance and encourage sycophantic tendencies.

Morning Overview on MSN

Google unveiled TurboQuant, a method that cuts the memory bottleneck slowing large AI models

Companies running large language models face a persistent bottleneck: the memory consumed by key-value caches during ...

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

You can now download Gemma 4 models with quantization-aware training to reduce the amount of mobile memory required to 1GB.

Nature

Video Compression Algorithms and Memory Efficiency

Video compression has become an essential technology to meet the burgeoning demand for high‐resolution content while maintaining manageable file sizes and transmission speeds. Recent advances in ...

Semiconductor Engineering

Freeing Up Near-Memory Capacity For Cache Using Compression Techniques In A Flat Hybrid-Memory Architecture

A technical paper titled “HMComp: Extending Near-Memory Capacity using Compression in Hybrid Memory” was published by researchers at Chalmers University of Technology and ZeroPoint Technologies.

Electronic Design

Hardware Compression Works at the Memory Cache Level

How lossless data compression can reduce memory and power requirements. How ZeroPoint’s compression technology differs from the competition. One can never have enough memory, and one way to get more ...

Hackaday

compression attached memory

For some computing components, the bottleneck to improved speed and performance hasn’t been power consumption or clock speed but physical space. But a new memory standard may provide all of the power ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results