From Bits to Tokens: Knowledge-Driven
Generative Communication of Multimodal Data

NSDI 2026
1University of California, San Diego 2University of Southern California 3Chinese University of Hong Kong 4University of Michigan, Ann Arbor
KDC system workflow: task-aware transmitter, codec wrapper, and knowledge-conditioned receiver

KDC transmits semantic knowledge rather than raw bits. The task-aware transmitter prioritizes semantically critical content, while the knowledge-conditioned receiver restores corrupted data through reasoning over shared and accumulated knowledge bases.

Abstract

Classical communication systems strive for bit-by-bit reconstruction, yet this objective often misaligns with downstream application tasks, such as perception and decision-making with sensor data. This mismatch is amplified in wireless settings, where packet losses and channel dynamics make strict bit-fidelity both costly and fragile. In this paper, we introduce Knowledge-Driven Communication (KDC), a framework that transmits semantic knowledge rather than raw bits by leveraging pretrained knowledge bases. KDC features a task-aware transmitter, which uses multimodal foundation models to abstract source data into tokenized embeddings and prioritize semantically critical content, enabling zero-shot adaptation without task-specific retraining. On the receiver side, KDC employs pretrained knowledge bases and incrementally updated context to reconstruct task-relevant information, enabling graceful degradation even under data loss. We implement a full KDC prototype and evaluate it over diverse data modalities and wireless networks. KDC operates as an application-layer codec wrapper and receiver-side restoration module, fully compatible with existing wireless communication protocols and source/channel coding mechanisms. Experiments show that KDC consistently outperforms state-of-the-art codecs and learned baselines, achieving high task accuracy with a fraction of the transmitted data, while maintaining robustness under challenging wireless conditions.

System Overview

KDC abstracts heterogeneous sensor data into a unified token space

Unified multimodal tokenization. KDC abstracts heterogeneous sensor data (images, video, 3D volumes, point clouds) into a unified token space via existing multimodal foundation models, then scores each token's semantic importance for task-aware prioritization.

BibTeX

@inproceedings{chen2026kdc, title = {From Bits to Tokens: Knowledge-Driven Generative Communication of Multimodal Data}, author = {Chen, Xingyu and Feng, Zihao and Zhao, Wuqiong and Ding, Jianrong and Sun, Ke and Zhang, Xinyu}, booktitle = {Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI)}, year = {2026}, }