relm-1

relm-1 is a specialized post-train of Gemma 4 (31B-it) designed to excel in complex reasoning, high-difficulty SQL generation, telco software, and specialized coding tasks.

Model Summary

Base Model: google/gemma-4-31B-it
Training Method: Post-trained via LoRA (Low-Rank Adaptation) using the scalarlm framework, with weights merged back into the base model for standalone deployment.
Specializations: Complex SQL, OpenTelco (OTel), Relational AI PyRel, and General Reasoning.
Context Window (SFT): 6,144 tokens

🎯 Intended Use

relm-1 is designed for developers and data engineers who need a model capable of:

Advanced SQL Generation: Solving high-complexity database queries, including specialized Snowflake SQL tasks and benchmarks based on Spider-2.
Relational AI PyRel: Generating and reasoning over PyRel, a Pythonic relational language that translates high-level logic into optimized SQL.
Observability Analysis: Understanding and generating telco prompts and completions (OTel-LLM).
Reasoning-Heavy Tasks: Leveraging Chain-of-Thought (CoT) to decompose complex software engineering problems.

🛠️ Training Details

Dataset Composition

The model underwent post-training on a balanced mixture of high-quality SFT data:

Dataset	Weight	Focus
Relational AI & PyRel	50%	PyRel and relational logic
Spider-2	20%	Complex SQL generation
Nemotron-Cascade-2-SFT	10%	General chat and instruction following
OTel-LLM	10%	OpenTelco
Specialized Snowflake SQL	10%	Advanced Snowflake SQL tasks

Training Hyperparameters

Learning Rate: 2e-4
Max Steps: 6,000
Gradient Accumulation: 2
Architecture: LoRA adaptation merged into the 31B base model.

✍️ Prompting & Usage

Reasoning Format

relm-1 utilizes Gemma 4's native channel-thought format for reasoning. To trigger Chain-of-Thought, the model is trained to follow this structure:

<thought>
{Step-by-step reasoning and analysis}
</thought>
{Final answer}

Specialized Directives

The model responds to specific formatting directives injected during post-training:
- FINOS Legend: For tasks requiring the "Pure" version of FINOS Legend, the model is optimized for the prompt: "Answer using FINOS Legend (Pure language)."
- Hybrid SQL/PyRel: For tasks requiring dual implementations, the model is optimized for: "Provide both a SQL solution and a PyRel solution."

Chat Template

The model follows the official Gemma 4 template:

<|user|>
{prompt}<|end_of_turn|>
<|model|>
{response}<|end_of_turn|>

⚠️  Limitations

- Context Window: Post-training samples were filtered to a maximum of 6,144 tokens; performance on extremely long documents may vary.
- Base Model Dependency: While the weights are merged, the model's foundational capabilities are inherited from the Gemma 4 31B-it base.

Downloads last month: 15

Safetensors

Model size

31B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for gdiamos/relm-1

Base model

google/gemma-4-31B

Finetuned

google/gemma-4-31B-it

Finetuned

(190)

this model