SciRIFF
dataset137K instruction-following demonstrations across 54 scientific literature tasks (extraction, summarization, QA, claim verification, classification). SciTulu models improve over baselines by 28.1% at 7B and 6.5% at 70B on scientific tasks while maintaining general instruction-following.
Paper
arXiv: 2406.07835