AI case study • LLM fine tuning

LLora: LLM fine tuning for business with QLoRA for AI Act and compliance on a small locally runnable model.

LLora documents a complete domain adaptation pipeline: curated JSONL dataset, QLoRA training, intermediate checkpoints, final LoRA adapter and metric interpretation.

See the pipeline Back to portfolio

JSONL instruction/output examples

best checkpoint identified

52.6%

training loss reduction

unsloth/Qwen2.5-1.5B-Instruct-bnb-4bit Base model

Method QLoRA

LoRA rank 16

Eval loss -31.2%

Dataset

50 examples on AI Act, compliance and operational support for SMEs.

JSONL

Training

4-bit setup, effective batch 4, 5 epochs, linear scheduler.

5 ep

Output

Exported LoRA adapter and readable intermediate checkpoints.

PEFT

Overview

A documented technical experiment, not a generic AI demo.

LLora was built to specialize a compact model on regulatory and operational content related to the European AI Act. The goal was not a general purpose model, but a realistic domain adaptation workflow.

The value is traceability: dataset, configuration, metrics, checkpoints and final adapter are presented as parts of a verifiable ML system.

Clear domain

AI Act, compliance and internal procedures for Italian SMEs, with outputs designed for operational support.

Lightweight setup

Qwen2.5 1.5B base model in 4-bit, separate LoRA adapter and a workflow suitable for local reuse.

Challenge

Turning a complex regulatory topic into a training-ready dataset.

The challenge was not only training a model. It was defining a clean information boundary, with coherent examples, consistent responses and a credible specialization goal.

small but curated dataset
unique instructions and coherent outputs
regulatory topic with risk of generic answers
need to read metrics and checkpoints correctly

Solution

A complete, measured and presentable QLoRA pipeline.

The solution uses QLoRA on a 4-bit quantized model, rank 16, lightweight adapter and intermediate checkpoints. The best checkpoint is identified at step 40, avoiding the mistake of treating the final step as the best model.

instruction tuning on question/answer pairs
LoRA applied to attention and MLP modules
training loss from 2.395 to 1.136
minimum eval loss 1.500 at step 40

Qwen2.5 1.5B 4-bit

Compact instruction tuned and quantized model selected for efficient experimentation and local reuse.

50 JSONL examples

Instruction/output records with 40/10 split, responses close to 100 words on average and consistent structure.

Rank 16 adapter

Efficient fine tuning without fully retraining the base model, with LoRA alpha 16 and dropout 0.

Controlled run

Effective batch 4, 5 epochs, learning rate 0.0002, short warmup and linear scheduler.

Best checkpoint

Best eval loss at step 40, with critical reading of the final plateau and generalization gap.

Exported adapter

Reusable PEFT output, intermediate checkpoints and technical reporting ready for review.

Domain specialization Adaptation of a compact model to a specific regulatory perimeter.

Training loss reduction 52.6% improvement across the documented run.

Non-trivial evaluation Best checkpoint identified before the final step, with plateau interpretation.

Technical reuse Exported LoRA adapter, lightweight and suitable for local experimentation scenarios.

CTA

A case study for companies that want controllable proprietary AI.

LLora positions ZenkeiX as a studio able to build and document applied LLM workflows: from dataset to training, technical evaluation and artifact reuse.

Talk to ZenkeiX Back to AI portfolio