Clear domain
AI Act, compliance and internal procedures for Italian SMEs, with outputs designed for operational support.
LLora documents a complete domain adaptation pipeline: curated JSONL dataset, QLoRA training, intermediate checkpoints, final LoRA adapter and metric interpretation.
Overview
LLora was built to specialize a compact model on regulatory and operational content related to the European AI Act. The goal was not a general purpose model, but a realistic domain adaptation workflow.
The value is traceability: dataset, configuration, metrics, checkpoints and final adapter are presented as parts of a verifiable ML system.
AI Act, compliance and internal procedures for Italian SMEs, with outputs designed for operational support.
Qwen2.5 1.5B base model in 4-bit, separate LoRA adapter and a workflow suitable for local reuse.
Challenge
The challenge was not only training a model. It was defining a clean information boundary, with coherent examples, consistent responses and a credible specialization goal.
Solution
The solution uses QLoRA on a 4-bit quantized model, rank 16, lightweight adapter and intermediate checkpoints. The best checkpoint is identified at step 40, avoiding the mistake of treating the final step as the best model.
Technical stack
The case study highlights ML design, data preparation, controlled training and metric interpretation.
Compact instruction tuned and quantized model selected for efficient experimentation and local reuse.
Instruction/output records with 40/10 split, responses close to 100 words on average and consistent structure.
Efficient fine tuning without fully retraining the base model, with LoRA alpha 16 and dropout 0.
Effective batch 4, 5 epochs, learning rate 0.0002, short warmup and linear scheduler.
Best eval loss at step 40, with critical reading of the final plateau and generalization gap.
Reusable PEFT output, intermediate checkpoints and technical reporting ready for review.
Impact
LLora shows the ability to design an applied LLM workflow: data, method, training, metrics and technical documentation. The result is a credible operational demo, not a generic claim about artificial intelligence.
CTA
LLora positions ZenkeiX as a studio able to build and document applied LLM workflows: from dataset to training, technical evaluation and artifact reuse.
Select the categories you want to enable. Technical cookies are always active.