CoSER: Coordinating LLM-Based Persona Simulation of Established Roles Paper • 2502.09082 • Published 29 days ago • 28
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper • 2502.07346 • Published Feb 11 • 51
UI Agent Collection a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robots • 315 items • Updated about 8 hours ago • 47