Instructions to use FahrenheitResearch/FR-Forge-1.7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use FahrenheitResearch/FR-Forge-1.7B with MLX:

# Make sure mlx-lm is installed
# pip install --upgrade mlx-lm

# Generate text with mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("FahrenheitResearch/FR-Forge-1.7B")

prompt = "Write a story about Einstein"
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
    messages, add_generation_prompt=True
)

text = generate(model, tokenizer, prompt=prompt, verbose=True)

Notebooks
Google Colab
Kaggle
Local Apps Settings
LM Studio

How to use FahrenheitResearch/FR-Forge-1.7B with Pi:

Start the MLX server

# Install MLX LM:
uv tool install mlx-lm
# Start a local OpenAI-compatible server:
mlx_lm.server --model "FahrenheitResearch/FR-Forge-1.7B"

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "mlx-lm": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "FahrenheitResearch/FR-Forge-1.7B"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use FahrenheitResearch/FR-Forge-1.7B with Hermes Agent:

Start the MLX server

# Install MLX LM:
uv tool install mlx-lm
# Start a local OpenAI-compatible server:
mlx_lm.server --model "FahrenheitResearch/FR-Forge-1.7B"

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default FahrenheitResearch/FR-Forge-1.7B

Run Hermes

hermes

MLX LM

How to use FahrenheitResearch/FR-Forge-1.7B with MLX LM:

Generate or start a chat session

# Install MLX LM
uv tool install mlx-lm
# Interactive chat REPL
mlx_lm.chat --model "FahrenheitResearch/FR-Forge-1.7B"

Run an OpenAI-compatible server

# Install MLX LM
uv tool install mlx-lm
# Start the server
mlx_lm.server --model "FahrenheitResearch/FR-Forge-1.7B"
# Calling the OpenAI-compatible server with curl
curl -X POST "http://localhost:8000/v1/chat/completions" \
   -H "Content-Type: application/json" \
   --data '{
     "model": "FahrenheitResearch/FR-Forge-1.7B",
     "messages": [
       {"role": "user", "content": "Hello"}
     ]
   }'

FR-Forge-1.7B

A Thin Language Model for manufacturing. Grounded, specialized, and running on your laptop.

GROUNDED. SPECIALIZED. LOCAL.

Overview
How it works
Specifications
Quickstart
Intended use
Evaluation
Limitations
Training
Citation

Overview

FR-Forge-1.7B is a Thin Language Model (TLM) for the manufacturing sector, by Fahrenheit Research. Sibling to FR-Lex (legal). It is a small, 4-bit, MLX model fine-tuned with LoRA adapters on a curated manufacturing instruction corpus, designed to run locally on Apple Silicon.

  Where a Thin Language Model fits

  specialization
     ▲
     │   ● FR-Forge 1.7B
     │     narrow · local · cheap
     │
     │                              ● Frontier LLM
     │                                broad · hosted · costly
     └──────────────────────────────────────────────▶ generality

How it works

  Manufacturing question
            │
            ▼
  ┌─────────────────────────────────────────────────────┐
  │ FR-Forge 1.7B                                         │
  └─────────────────────────────────────────────────────┘
            │
   reasons across three pillars
            │
            ├─ Ops & Maintenance        SOPs · PM/PdM · OEE · troubleshooting
            ├─ Quality & Compliance     ISO 9001 · IATF 16949 · FDA/GMP · CAPA · FMEA · SPC
            └─ Supply Chain & Planning  MRP/ERP · BOMs · inventory · demand planning
            │
            ▼
   Grounded, local answer  (verify exact clauses against the controlling standard)

It is an assistant, not a certified authority. It is not a substitute for the controlling standard, safety sign-off, or regulatory advice.

Specifications


Base	Qwen3-1.7B (4-bit, MLX)
Parameters	1.7B · 4-bit
Method	LoRA adapters, fused
Runtime	MLX (Apple Silicon)
Languages	English
License	Apache-2.0

Quickstart

Runs locally with MLX on Apple Silicon. Apply a repetition penalty to avoid looping.

pip install -U mlx-lm
python3 -m mlx_lm generate \
  --model FahrenheitResearch/FR-Forge-1.7B \
  --repetition-penalty 1.15 \
  --max-tokens 500 \
  --prompt "How should I set safety stock for a long-lead component with variable demand?"

Recommended generation settings: repetition_penalty = 1.15 (raise to 1.3 if output repeats). A system prompt of "You are FR-Forge, a manufacturing domain assistant." matches training.

Intended use

Assisting manufacturing teams with everyday domain questions across three pillars:

  FR-Forge covers
  ├─ Shop-floor ops & maintenance   SOPs, work instructions, equipment manuals, PM/PdM, OEE, troubleshooting
  ├─ Quality & compliance           ISO 9001, IATF 16949, ISO 13485, FDA/GMP, CAPA, FMEA, SPC, MSA, audits
  └─ Supply chain & planning         MRP/ERP, procurement, BOMs, inventory, demand planning, suppliers

Out of scope. Safety, regulatory, or compliance sign-off; anything that must be exact (part numbers, clause text, customer-specific requirements) without grounding it in your own documents.

Evaluation

A held-out set of prompts across the three pillars is scored with a deterministic keyword-rubric: each item defines groups of required terms, and a group passes if any synonym appears in the answer. The item score is the fraction of groups covered; pillar and overall scores are averages. Generation uses a 1.15 repetition penalty.

  Held-out eval, keyword rubric (percent coverage)

  Ops & Maintenance        ███████████████████░  92.9
  Quality & Compliance     ██████████████░░░░░░  70.8
  Supply Chain & Planning  █████████████████░░░  83.3
  ───────────────────────  ────────────────────  ────
  Overall                  ████████████████░░░░  82.3

Pillar	Score
Ops & Maintenance	92.9%
Quality & Compliance	70.8%
Supply Chain & Planning	83.3%
Overall	82.3%

This measures domain-term coverage, not eloquence or factual grading, and the held-out set is small, so treat results as a directional, reproducible yardstick rather than a precise grade. The untuned base can be scored with python3 scripts/evaluate.py --base-only, and the tuned model is reproduced with python3 scripts/evaluate.py --max-tokens 500.

Limitations

Not a certified authority. Outputs assist research and drafting only. They are not a substitute for the controlling standard, safety sign-off, or regulatory advice.

English only (v1).
Paraphrased, not verbatim. Trained on domain reasoning, not reproduced standards; always verify clause-level detail against the controlling standard.
Small model. For facts that must be exact (part numbers, clause text, customer-specific requirements), ground it with retrieval over your own documents rather than relying on memory.
Self-reported metric. Evaluation is an internal keyword-coverage score on a small held-out set.

Training

  Domain instruction pairs ─┐
  110 ops & maintenance      │
  100 quality & compliance   ├─▶ LoRA fine-tune (MLX) ─▶ fuse adapters ─▶ FR-Forge 1.7B
   51 supply chain & planning┘    iters 800 · lr 1e-5 · seq 512

  Training data, 261 instruction pairs
  Ops & Maintenance        ████████████████  110
  Quality & Compliance     ███████████████░  100
  Supply Chain & Planning  ███████░░░░░░░░░   51

Base, method, data, and hyperparameters

Base model: mlx-community/Qwen3-1.7B-4bit (Qwen3 architecture, 4-bit)
Method: LoRA adapters via mlx-lm, fused into this standalone model
Data: 261 curated instruction pairs (110 ops & maintenance, 100 quality & compliance, 51 supply chain & planning). Sources: paraphrased domain reasoning plus model-assisted synthetic pairs with human review. No copyrighted standard text is reproduced verbatim; the data teaches structure and reasoning and cites clause numbers.
Hyperparameters: iters 800, LoRA layers 8, batch size 1, max sequence length 512, learning rate 1e-5, gradient checkpointing on. Peak training memory ~2 GB.

Citation

Apache-2.0. Base model mlx-community/Qwen3-1.7B-4bit is Apache-2.0.

@software{fr_forge_2026,
  title  = {FR-Forge-1.7B: a thin language model for manufacturing},
  author = {Fahrenheit Research},
  year   = {2026},
  note   = {Fine-tuned from Qwen3-1.7B (4-bit) with MLX/LoRA}
}

FAHRENHEIT RESEARCH

Thin Language Models for specialized domains.

Website · GitHub · Sibling model: FR-Lex-1.7B

Downloads last month: 40

Safetensors

Model size

0.3B params

Tensor type

BF16

U32

MLX

Hardware compatibility

4-bit

Model tree for FahrenheitResearch/FR-Forge-1.7B

Base model

Qwen/Qwen3-1.7B-Base

Finetuned

Qwen/Qwen3-1.7B

Quantized

mlx-community/Qwen3-1.7B-4bit