低リソース言語におけるテキストデトックス化のためのタトクサシステム：タタール語の事例

要旨

テキストデトックス（有害・攻撃的なコンテンツの自動検出と軽減）は、オンラインコミュニティの安全性を確保し、ユーザーを保護するために不可欠である。しかし、タタール語のような低リソース言語については、これまで研究がほとんど進んでいない。本論文では、タタール語におけるテキストデトックスのための新たな最先端システムであるTatoxaを提案する。比較実験の結果、提案手法は主要な品質指標において、既存のオープンソースおよびプロプライエタリな商用大規模言語モデル（LLM）を上回ることが示された。また、低リソース環境でのファインチューニングと評価を目的とした、タタール語のテキストデトックス用の新しいデータセットも導入する。さらに、言語間転移実験から、文化的に近いロシア語を含む他言語からの転移は、大規模なロシア語コーパスが利用可能な場合でも、タタール語ネイティブデータでの学習よりも著しく性能が低いことが明らかになった。

English

Text detoxification, the automated detection and mitigation of abusive and harmful content, is essential for ensuring the safety of online communities and protecting users. However, low resource languages such as Tatar have received little research attention. In this paper we present Tatoxa, a novel state-of-the-art system for text detoxification in the Tatar language. Comparative experiments show that the proposed approach outperforms existing open source and proprietary commercial LLMs on key quality metrics. We also introduce a new dataset for text detoxification in Tatar, designed for fine tuning and evaluation in low resource settings. Finally, cross lingual transfer experiments indicate that transfer from other languages, including the culturally close Russian, performs significantly worse than training on native Tatar data even when a large Russian corpus is available.