Dasheng
paper modelAudio encoder and language model family. Original Dasheng encoder (1.2B params, 272K hours) evolved into DashengLM and MiDashengLM-7B for efficient audio understanding supporting speech, music, and acoustics.
Outputs 3
Dasheng: Scaling Masked Audio Encoder Learning
paperOriginal Dasheng audio encoder (1.2B params, 272K hours). Foundation for MiDashengLM and DashengTokenizer.
arXiv: 2406.06992
Dasheng-LM: Efficient Audio Understanding with General Audio Captions
paperResearch on efficient audio understanding using general audio captions.
arXiv: 2508.03983
MiDashengLM-7B
modelEfficient audio understanding model built on the "Dasheng" encoder, supporting speech, music, and acoustics.
Architecture DENSE
Parameters 7B