Model Compression - Search News

25m

PrismML Introduces The First Commercially Viable 1-Bit LLM

A Caltech Lab at PrismML Just Fit an 8 Billion Parameter AI Model Into 1.15 GB. Announcing a Breakthrough in AI Compression: ...

SiliconANGLE

Multiverse Computing bags $215M for its quantum-inspired AI model compression tech

Multiverse Computing S.L. said today it has raised $215 million in funding to accelerate the deployment of its quantum computing-inspired artificial intelligence model compression technology, which ...

WinBuzzer

Google’s TurboQuant Algorithm Slashes LLM Memory Use by 6x

Google has published TurboQuant, a KV cache compression algorithm that cuts LLM memory usage by 6x with zero accuracy loss, ...

Ars Technica

AI language models can exceed PNG and FLAC in lossless compression, says study

Effective compression is about finding patterns to make data smaller without losing information. When an algorithm or model can accurately guess the next piece of data in a sequence, it shows it’s ...

VentureBeat

Nvidia says it can shrink LLM memory 20x without changing model weights

Nvidia researchers have introduced a new technique that dramatically reduces how much memory large language models need to track conversation history — by as much as 20x — without modifying the model ...

Dark Reading

Intel Discloses Max Severity Bug in Its AI Model Compression Software

Intel has disclosed a maximum severity vulnerability in some versions of its Intel Neural Compressor software for AI model compression. The bug, designated as CVE-2024-22476, provides an ...

Digi Times

DeepSeek unveils optical compression model to enhance long text processing for AI

Chinese artificial intelligence startup DeepSeek has introduced DeepSeek-OCR, an open-source model accompanied by a research paper that pioneers a novel "optical compression" method aimed at reducing ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results