MMakeSense 智感

Article

动态

arXiv:2606.07577v1 Announce Type: new Abstract: Audio-visual large language models (LLMs) hold strong promise for long-form video understanding, yet their long-video inference is fundamentally limited by the linear growth of video tokens and key-value (KV) caches. We present OmniMem, a memory-efficient streaming framework designed specifically for audio-visual LLMs. Unlike existing compression methods that treat all tokens uniformly, OmniMem introduces a modality-aware memory allocation strateg

OmniMem: Perturbation-aware Memory Compression for Streaming Audio-Visual LLMs

技术2026-06-140 阅读

arXiv:2606.07577v1 Announce Type: new Abstract: Audio-visual large language models (LLMs) hold strong promise for long-form video understanding, yet their long-video inference is fundamentally limited by the linear growth of video tokens and key-value (KV) caches. We present OmniMem, a memory-efficient streaming framework designed specifically for audio-visual LLMs. Unlike existing compression methods that treat all tokens uniformly, OmniMem introduces a modality-aware memory allocation strateg

arXiv:2606.07577v1 Announce Type: new Abstract: Audio-visual large language models (LLMs) hold strong promise for long-form video understanding, yet their long-video inference is fundamentally limited by the linear growth of video tokens and key-value (KV) caches. We present OmniMem, a memory-efficient streaming framework designed specifically for audio-visual LLMs. Unlike existing compression methods that treat all tokens uniformly, OmniMem introduces a modality-aware memory allocation strateg

用本文提到的模型？

注册即送 1000 万 Token，GPT / Claude / Gemini 一键接入。

评论反馈

相关推荐

技术 · 2026-06-09

Beyond Goodhart's Law: A Dynamic Benchma

arXiv:2606.07805v1 Announce Type: new Abstract: The rapid evolution of Large La

技术 · 2026-06-04

Traj-Evolve

arXiv:2606.02812v1 Announce Type: new Abstract: Modeling patient trajectories f

技术 · 2026-05-30

CAPTCHAs can still detect AI agents

Main site CAPTCHAs can still detect AI agents AI systems now match and exceed hu

技术 · 2026-05-30

MCP is dead?

Articles MCP is dead Chloe Kim Backend Engineer @ Quandri : MCP eats context, ha