Sourab Mangrulkar

smangrul

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing, Natural Language Generation, Computer Vision, Reinforcement Learning

Recent Activity

liked a model 1 day ago
Qwen/Qwen2.5-1.5B-Instruct
liked a Space 15 days ago
huggingface/ai-deadlines
liked a model 2 months ago
mistralai/Mistral-7B-Instruct-v0.3
View all activity

Organizations

Speech Recognition Community Event Version 2's profile picture BigScience Data's profile picture group2's profile picture BigCode's profile picture Diffusers Pipelines Library for Stable Diffusion's profile picture Social Post Explorers's profile picture

smangrul's activity

published an article 12 months ago
view article
Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

27
published an article about 1 year ago
view article
Article

🤗 PEFT welcomes new merging methods

16
published an article over 1 year ago
view article
Article

Mixture of Experts Explained

449
published an article over 1 year ago
view article
Article

Personal Copilot: Train Your Own Coding Assistant

46
published an article over 1 year ago
view article
Article

Fine-tuning Llama 2 70B using PyTorch FSDP

21
published an article almost 2 years ago
view article
Article

The Falcon has landed in the HF中国镜像站 ecosystem

12
published an article almost 2 years ago
view article
Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

126
published an article about 2 years ago
view article
Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

43
published an article about 2 years ago
view article
Article

🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware

61
published an article over 2 years ago
view article
Article

Accelerate Large Model Training using DeepSpeed

3
published an article almost 3 years ago
view article
Article

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

3