
In late 2024, researchers from neurotechnology have released the preprint of their research on a finetuned Llama2 for Lithuanian language.
In their paper, researchers propose and describe the first open large language models (LLMs) for the Lithuanian language. The authors provide an accompanying question/answer (Q/A) dataset and translations of popular LLM benchmarks, as well as a detailed description of the proposed LLMs and their training process. Additionally, they conduct an empirical evaluation comparing the perplexities of the proposed LLMs with those of other modern open LLMs and benchmark the proposed model against language understanding tasks to determine the importance of high-quality pretraining datasets in achieving efficient model performance on these benchmarks.
Authors have also released the code base for this Lithuanian LLM at Hugging Face.
Comentarios