Lithuanian Llama2 for Assistive Writing

The ABILITY Team
Feb 3, 2025
1 min read

lt-llama2 performance graph (loss vs training steps) — lt-llama2 performance graph - loss vs training steps (image source: https://arxiv.org/pdf/2408.12963)

In late 2024, researchers from neurotechnology have released the preprint of their research on a finetuned Llama2 for Lithuanian language.

In their paper, researchers propose and describe the first open large language models (LLMs) for the Lithuanian language. The authors provide an accompanying question/answer (Q/A) dataset and translations of popular LLM benchmarks, as well as a detailed description of the proposed LLMs and their training process. Additionally, they conduct an empirical evaluation comparing the perplexities of the proposed LLMs with those of other modern open LLMs and benchmark the proposed model against language understanding tasks to determine the importance of high-quality pretraining datasets in achieving efficient model performance on these benchmarks.

Authors have also released the code base for this Lithuanian LLM at Hugging Face.

Blog posts

Lithuanian Llama2 for Assistive Writing

Recent Posts

Comments

Quick Links

Contact us

Follow