Model Details

Domain:

Task:

Model Access:

Open weights (non-commercial)

Introduction

We are pleased to announce our new foundation model family that includes Pharia-1-LLM-7B-control and Pharia-1-LLM-7B-control-aligned, now publicly available under the Open Aleph License, which explicitly allows for non-commercial research and educational use. Pharia-1-LLM-7B-control is engineered to deliver concise, length-controlled responses that match the performance of leading open-source models in the 7B to 8B parameter range and is culturally and linguistically optimized for German, French, and Spanish by being trained on a multilingual base corpus. Pharia-1-LLM-7B-control is trained on carefully curated data in compliance with applicable EU and national regulations, including copyright and data privacy laws. With improved token efficiency, Pharia-1-LLM-7B-control excels in domain-specific applications, particularly in the automotive and engineering industries, and can be aligned to user preferences, making it suitable for critical applications without the risk of shutdown behavior. As such, it serves as a valuable addition to the community’s selection of weight-available foundation models. Pharia-1-LLM-7B-control-aligned has been added with additional safety guardrails via alignment methods.

Benchmarking

FLOPs

4.43e+23

Notes: reported by the authors: 2.75*10^23 + 1.68*10^23 = 4.43*10^23 FLOP https://huggingface.co/Aleph-Alpha/Pharia-1-LLM-7B-control#compute--training-efficiency

Training

Training Code Accessibility

https://huggingface.co/Aleph-Alpha/Pharia-1-LLM-7B-control training framework is released here: https://github.com/Aleph-Alpha-Research/scaling

Hardware

NVIDIA A100 SXM4 80 GB,NVIDIA H100 SXM5 80GB

Hardware Quantity

256

Size Notes: 4.7T + 3T = 7.7T tokens

Parameters

7041544704

Aleph Alpha | Pharia-1-LLM-7B , Capabilities, Benchmarks and Use Cases, 2026

Model Details

Domain:

Task:

Model Access:

Open weights (non-commercial)

Introduction

Benchmarking

FLOPs

4.43e+23

Notes: reported by the authors: 2.75*10^23 + 1.68*10^23 = 4.43*10^23 FLOP https://huggingface.co/Aleph-Alpha/Pharia-1-LLM-7B-control#compute--training-efficiency

Training

Training Code Accessibility

https://huggingface.co/Aleph-Alpha/Pharia-1-LLM-7B-control training framework is released here: https://github.com/Aleph-Alpha-Research/scaling

Hardware

NVIDIA A100 SXM4 80 GB,NVIDIA H100 SXM5 80GB

Hardware Quantity

256

Size Notes: 4.7T + 3T = 7.7T tokens

Parameters

7041544704

Pharia-1-LLM-7B - Use Model

Pharia-1-LLM-7B - Use Model

Model Details

Introduction

Benchmarking

Training

Parameters

Pharia-1-LLM-7B - Use Model

Pharia-1-LLM-7B - Use Model

Model Details

Introduction

Benchmarking

Training

Parameters