Submitted by akhaliq 64 AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning · 7 authors 8
Submitted by akhaliq 22 Semantic-SAM: Segment and Recognize Anything at Any Granularity · 9 authors 1
Submitted by akhaliq 19 Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration · 6 authors
Submitted by akhaliq 7 Empowering Cross-lingual Behavioral Testing of NLP Models with Typological Features · 2 authors
Submitted by akhaliq 7 On decoder-only architecture for speech-to-text and large language model integration · 11 authors
Submitted by akhaliq 4 Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement · 8 authors
Submitted by akhaliq 2 AnyTeleop: A General Vision-Based Dexterous Robot Arm-Hand Teleoperation System · 8 authors