HF中国镜像站

William Berrios's picture

6 7 38

William Berrios

will33am

·

https://williamberrios.github.io/

AI & ML interests

multimodal-learning, representation learning, generative modeling, robustness.

Recent Activity

upvoted a paper 2 days ago

Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru

updated a dataset 3 months ago

ContextualAI/LFQA_eval_dataset_unit_tests_contrastive

updated a dataset 3 months ago

ContextualAI/LFQA_eval_dataset_unit_tests_justification

View all activity

Organizations

will33am's activity

upvoted a paper 2 days ago

Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru

Paper • 2503.07587 • Published 4 days ago • 10

updated 3 datasets 3 months ago

ContextualAI/LFQA_eval_dataset_unit_tests_contrastive

Viewer • Updated Dec 13, 2024 • 260 • 54

ContextualAI/LFQA_eval_dataset_unit_tests_justification

Viewer • Updated Dec 13, 2024 • 260 • 58

ContextualAI/LFQA_eval_dataset_unit_tests_with_justification

Viewer • Updated Dec 13, 2024 • 5 • 66

liked a model 3 months ago

yujiepan/llama-3.3-tiny-random

Text Generation • Updated Dec 6, 2024 • 504 • 1

commented a paper 4 months ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21, 2024 • 43 •

upvoted a paper 4 months ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21, 2024 • 43

New activity in NinaCalvi/infobench_expert_split 4 months ago

Question how to generate the dataset

#2 opened 4 months ago by

liked a dataset 4 months ago

NinaCalvi/infobench_expert_split

Viewer • Updated Oct 9, 2024 • 927 • 79 • 1

updated a dataset 4 months ago

Self-GRIT/frames-benchmark

Viewer • Updated Nov 2, 2024 • 824 • 54

liked a dataset 4 months ago

parasail-ai/frames-benchmark-wikipedia

Viewer • Updated Dec 2, 2024 • 2.52k • 173 • 3

updated 9 datasets 5 months ago

Self-GRIT/frames_random_subsample_eval

Viewer • Updated Oct 25, 2024 • 400 • 51

Self-GRIT/frames_eval

Viewer • Updated Oct 25, 2024 • 824 • 51

Self-GRIT/frames-benchmark-chunksize-1024-chunkoverlap-20

Viewer • Updated Oct 25, 2024 • 824 • 47

Self-GRIT/contextual_bench_hotpotqa_random_subsample_eval

Viewer • Updated Oct 11, 2024 • 400 • 46

Self-GRIT/contextual_bench_hotpotqa_eval

Viewer • Updated Oct 11, 2024 • 7.41k • 73

Self-GRIT/alpaca_eval_random_subsample_eval

Viewer • Updated Oct 9, 2024 • 50 • 59

Self-GRIT/triviaqa_unfiltered_unhidden_random_subsample_eval

Viewer • Updated Oct 9, 2024 • 400 • 50

Self-GRIT/triviaqa_unfiltered_unhidden_eval

Viewer • Updated Oct 9, 2024 • 11.3k • 57

Self-GRIT/popqa_long_tail_random_subsample_eval

Viewer • Updated Oct 8, 2024 • 400 • 57