Dasheng | Lab Index

Audio encoder and language model family. Original Dasheng encoder (1.2B params, 272K hours) evolved into DashengLM and MiDashengLM-7B for efficient audio understanding supporting speech, music, and acoustics.

Paper (arXiv, Dasheng)Paper (arXiv, DashengLM)GitHub

Outputs 3

Dasheng: Scaling Masked Audio Encoder Learning

paper

Original Dasheng audio encoder (1.2B params, 272K hours). Foundation for MiDashengLM and DashengTokenizer.

Paper (arXiv)

arXiv HTML

Dasheng-LM: Efficient Audio Understanding with General Audio Captions

paper 2025-08-06

Research on efficient audio understanding using general audio captions.

Paper (arXiv)GitHub

arXiv HTML

MiDashengLM-7B

model 2025-08-06

Efficient audio understanding model built on the "Dasheng" encoder, supporting speech, music, and acoustics.

GitHub

Architecture DENSE

Parameters 7B

audioarchitectureopen-weight

Your notes

Outputs 3

Dasheng: Scaling Masked Audio Encoder Learning

Dasheng-LM: Efficient Audio Understanding with General Audio Captions

MiDashengLM-7B