Rakesh Tanwar
rakeshtanwar
AI & ML interests
None yet
Recent Activity
liked a model about 10 hours ago
prithivMLmods/Qwen3.5-35B-A3B-abliterated-v2-MAX reacted to pankajpandey-dev's post with ๐ฅ about 10 hours ago
๐ฎ๐ณ Qwen3-4B Hindi Instruct v2 โ a Hindi LLM that runs on your own machine
Most strong Hindi-capable models are either huge or cloud-only. I wanted one that's small enough to run locally but actually follows instructions in Hindi โ so I fine-tuned Qwen3-4B on 10K Hindi instruction pairs and shipped it with a full GGUF quant ladder.
โ
Fine-tune (16-bit): huggingface.co/pankajpandey-dev/Qwen3-4B-Hindi-Instruct-v2
โ
GGUF (Q4/Q5/Q8): huggingface.co/pankajpandey-dev/Qwen3-4B-Hindi-Instruct-v2-GGUF
Runs in Ollama, llama.cpp, and LM Studio. The Q4_K_M is just 2.5 GB โ fits comfortably on a laptop, CPU or GPU.
Part of my Hindi LLM Series โ building openly-licensed Indic models for local and edge use. More coming (Gemma next). Feedback welcome ๐
#Hindi #IndicNLP #GGUF #LocalLLM #Qwen
upvoted an article about 10 hours ago
Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler