Detecting Abusive Comments in Mizo: A Machine Learning Approach for a Low-Resource Language
- Title
- Detecting Abusive Comments in Mizo: A Machine Learning Approach for a Low-Resource Language
- Creator
- Lalrinmawii, R.; Lalramhluna, Robert; Gunavathi, R.
- Description
- The detection of abusive language in online spaces is crucial for ensuring a safe digital environment, particularly for low-resource languages like Mizo. Mizo, a tonal Tibeto-Burman language spoken primarily in Mizoram, India, poses significant computational challenges due to its phonetic complexity and limited linguistic resources. This research presents a method based on machine learning for abusive comment detection in Mizo, addressing the lack of annotated datasets and specialized NLP tools. A structured pipeline involving data collection, preprocessing, feature engineering, and model evaluation was implemented. Our study compares the effectiveness of several conventional machine learning methods, such as Random Forest, Support Vector Machines (SVM), Logistic Regression, and XGBoost, against transformer-based models such as Multilingual BERT(mBERT) and MizBERT. According to experimental data, MizBERT achieves the highest accuracy and F1-score, outperforming all other models by a substantial margin. This work contributes to the development of computational tools for Mizo NLP, laying a foundation for automated moderation systems and fostering digital inclusivity for Mizo-Speaking communities. 2025 IEEE.
- Source
- Proceedings - IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS;Issue;2025;
- Date
- 01-01-2025
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Subject
- abusive comment detection; content moderation; digital safety; Logistic Regression; low-resource NLP; machine learning; MizBERT; Mizo language; multilingual BERT (mBERT); n-grams; Random Forest; Support Vector Machines (SVM); TF-IDF vectorization; tonal language processing; transformer models; XGBoost
- Coverage
- Lalrinmawii R., Christ University, Department of Data Science, India; Lalramhluna R., National Institute of Technology, Department of Computer Science & Engineering, India; Gunavathi R., Christ University, Department of Data Science, India
- Rights
- Restricted Access; Hardcopy may be available in the library
- Relation
- ISSN: 26436213;
- Format
- online
- Language
- English
- Type
- Conference paper
Collection
Citation
Lalrinmawii, R.; Lalramhluna, Robert; Gunavathi, R., “Detecting Abusive Comments in Mizo: A Machine Learning Approach for a Low-Resource Language,” CHRIST (Deemed To Be University) Institutional Repository, accessed June 18, 2026, https://archives.christuniversity.in/items/show/25769.
