Models Compression - Search News

12d

Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit

LCLMs compress LLM context before decode — 8.8x faster at 16x compression, beating every KV cache method tested. Open-sourced by NYU and Columbia.

Nature

Context-aware implicit neural representations to compress Earth systems model data

Multiphysics, multiscale climate models, such as the Energy Exascale Earth System Model (E3SM) generate massive volumes of data over extended time periods to support long-term climate analysis. Data ...

Morning Overview on MSN

Google unveiled TurboQuant, a method that cuts the memory bottleneck slowing large AI models

Companies running large language models face a persistent bottleneck: the memory consumed by key-value caches during ...

The Manila Times

Multiverse Computing Launches Pulsar 16B in collaboration with NVIDIA: Frontier-Grade Reasoning at Half the Parameters

The new open reasoning model delivers 30B-class intelligence in a 16B-parameter footprint, with 3.1B active parameters, validated independently on NVIDIA accelerated computing infrastructure.

SiliconANGLE

Report: AI model compression startup Multiverse seeking €500M funding round

Multiverse Computing SL, a startup with technology that reduces the hardware footprint of artificial intelligence models, is reportedly raising new capital. Sources told Bloomberg today the Spanish ...

Ars Technica

Google’s TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x

Even if you don’t know much about the inner workings of generative AI models, you probably know they need a lot of memory. Hence, it is currently almost impossible to buy a measly stick of RAM without ...

Ars Technica

Running local models on Macs gets faster with Ollama’s MLX support

Ollama, a runtime system for operating large language models on a local computer, has introduced support for Apple’s open source MLX framework for machine learning. Additionally, Ollama says it has ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results