In Memory Data Cache - Search News

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

TMCnet

Breaking the 100M Token Limit: EverMind's MSA Architecture Achieves Efficient End-to-End Long-Term Memory for LLMs

This approach can be viewed as a memory plug-in for large models, providing a fresh perspective and direction for solving the ...

12d

New KV cache compaction technique cuts LLM memory 50x without accuracy loss

MIT researchers developed Attention Matching, a KV cache compaction technique that compresses LLM memory by 50x in seconds — ...

EE World Online

How to approach AI hardware design to address the memory wall?

This article outlines the design strategies currently used to address these bottlenecks, ranging from data center systolic ...

EDN

Last-level cache has become a critical SoC design element

LLC, positioned between external memory and internal subsystems, stores frequently accessed data close to compute resources.

Morning Overview on MSN

Nanoengineered spintronic memory stores data in 4 resistance states

A magnetic tunnel junction engineered to produce four distinct resistance states instead of the standard two could double the data density of spintronic memory without requiring additional physical ...

GovCon Wire

VAST Data Federal’s Randy Hayes on Building a Modern Data Foundation for Government AI

VAST Data Federal's Randy Hayes said agencies looking to advance AI should replace fragmented systems with a single data ...

Semiconductor Engineering

Freeing Up Near-Memory Capacity For Cache Using Compression Techniques In A Flat Hybrid-Memory Architecture

A technical paper titled “HMComp: Extending Near-Memory Capacity using Compression in Hybrid Memory” was published by researchers at Chalmers University of Technology and ZeroPoint Technologies.

14d

Huawei Launches AI Data Platform to Bridge Models and Business Value

At the Huawei Product & Solution Launch during MWC Barcelona 2026, Yuan Yuan, President of Huawei Data Storage Product Line, officially launched Huawei's AI Data Platform. The platform integrates ...

Marvell Launches Next-generation CXL Switch, Enabling Memory Pooling to Break Through the AI "Memory Wall"

Marvell Technology, Inc. (NASDAQ: MRVL), a leader in data infrastructure semiconductor solutions, today announced Marvell® ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results