Submitted by akhaliq 35 INDUS: Effective and Efficient Language Models for Scientific Applications · 34 authors 1
Submitted by akhaliq 23 Layer-Condensed KV Cache for Efficient Inference of Large Language Models · 2 authors 1
Submitted by akhaliq 14 Observational Scaling Laws and the Predictability of Language Model Performance · 3 authors 1
Submitted by akhaliq 8 Dynamic data sampler for cross-language transfer learning in large language models · 7 authors