Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
embedl 's Collections
FlashHead
EdgeN
Cosmos-Reason2
Qwen3.5
NVIDIA Jetson Orin Nano
NVIDIA Jetson AGX Orin
NVIDIA Jetson AGX Thor

EdgeN

updated 18 days ago

Quantization strategy where most weights are converted to INT4, activations remain in FP16, and sensitive layers are preserved in FP16.

Upvote
1

  • embedl/Cosmos-Reason2-2B-W4A16-Edge2

    Image-Text-to-Text • 2B • Updated 12 days ago • 994 • 12

  • embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead

    Image-Text-to-Text • 2B • Updated 12 days ago • 1.87k • 9

  • Running
    6

    Edge Inference Benchmarks

    🚀
    6

    On-Device benchmarks across devices and models.

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs