Paper Overview · ICLR 2026
Uni-X: Mitigating Modality Conflict in Unified Multimodal Models
Uni-X:缓解统一多模态模型中的模态冲突
A two-end-separated, middle-shared architecture for reducing modality conflict in unified multimodal understanding and generation.
介绍“两端分离,中间共享”的 Uni-X 架构,以及它如何缓解统一多模态理解与生成中的模态冲突。
Unified multimodal models
Modality conflict
GenEval
Read阅读
Paper Overview · NeurIPS 2025 Spotlight
Low-Rank Clone: Efficient Knowledge Distillation for LLMs
Low-Rank Clone:高效知识蒸馏
An overview of LRC, which uses low-rank projection and activation clone to make small language model training far more token-efficient.
概览 LRC 如何通过低秩投影和激活克隆,让小语言模型训练具备更高 token 效率。
Knowledge distillation
Small language models
Low-rank clone
Read阅读
Paper Overview · ICLR 2025
OmniKV: Dynamic Context Selection for Efficient Long-Context LLMs
OmniKV:面向高效长上下文 LLM 的动态上下文选择
A method overview for token-dropping-free long-context inference with dynamic KV-cache selection and offloading.
介绍 OmniKV 如何通过动态 KV-cache 选择与 offloading,实现不永久丢 token 的长上下文高效推理。
Long-context LLMs
KV cache
Efficient inference
Read阅读