ManifoldGL โ InformationโGeometric Adapter for LLMs
ManifoldGL is a parameterโefficient adapter that enforces hyperbolic geometry on the latent space of large language models. It treats the meaning of a token as a fiber over a hyperbolic base manifold (a Poincarรฉ ball), rather than a single vector in flat Euclidean space. Latent states are projected onto the ball, and attentions are computed using geodesic distance. A sheafโtheoretic consistency loss and natural gradient optimization maintain semantic structure during training.
Motivation and theoretical background
Modern LLMs embed tokens in a Euclidean vector space. While convenient, Euclidean geometry has limited capacity to represent hierarchical structures: flat space grows polynomially, whereas hierarchical trees expand exponentially. By contrast, hyperbolic space grows exponentially and preserves both local and global relationships in a hierarchyใ247949143190903โ L115-L124ใ. Hyperbolic embeddings outperform Euclidean ones for lexical entailment, similarity and analogy tasksใ247949143190903โ L154-L169ใ. ManifoldGL leverages these properties by modelling the latent space as a fiber bundle over a hyperbolic base: each point in the Poincarรฉ ball encodes a context, and its fiber contains a distribution of semantic components.
Results on ARCโAGI benchmark
ManifoldGL fineโtuned on Qwen2.5โ7B improves task accuracy on the ARCโAGI benchmark from 12.4 % to 28.7 %, a 131.5 % relative improvement. The model also achieves a Manifold Faithfulness Rate (MFR) of 94.2 %, indicating high adherence to the hyperbolic constraints, and maintains a curvature close to the target ฮบ = โ1 (mean โ0.98 ยฑ 0.04). Ablation studies show that removing curvature regularization, natural gradients, sheaf consistency or the hyperbolic target significantly reduces accuracy; the Euclidean target ablation causes the largest drop (โ10.9 %), highlighting the importance of hyperbolic geometry.
Files in this repository
This model card accompanies adapter weights trained with ManifoldGL. The files follow the structure of the original repository:
adapter_config.jsonโ configuration for PEFT/LoRA loadingpytorch_adapter.binโ adapter weightsREADME.mdโ this model card
Quick start
from transformers import AutoModelForCausalLM
from peft import PeftModel
# Load the base model (Qwen2.5-7B)
base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-7B")
# Load the ManifoldGL adapter
model = PeftModel.from_pretrained(base_model, "jesusvilela/manifoldgl")
# Now use model.generate(...) to generate text with hyperbolic adapters
Usage
This adapter can be loaded with PEFT on top of any compatible Qwen2.5โ7B model. During generation, latent states are projected into hyperbolic space and meaning is represented as fibers. We recommend using FP32 precision for maximum stability.
Citation
If you use ManifoldGL in your work, please cite the accompanying thesis and repository.
- Downloads last month
- 8
Model tree for jesusvilela/manifoldgl
Base model
Qwen/Qwen2.5-7B