Detecting Abusive Comments in Mizo: A Machine Learning Approach for a Low-Resource Language

Title: Detecting Abusive Comments in Mizo: A Machine Learning Approach for a Low-Resource Language
Creator: Lalrinmawii, R.; Lalramhluna, Robert; Gunavathi, R.
Description: The detection of abusive language in online spaces is crucial for ensuring a safe digital environment, particularly for low-resource languages like Mizo. Mizo, a tonal Tibeto-Burman language spoken primarily in Mizoram, India, poses significant computational challenges due to its phonetic complexity and limited linguistic resources. This research presents a method based on machine learning for abusive comment detection in Mizo, addressing the lack of annotated datasets and specialized NLP tools. A structured pipeline involving data collection, preprocessing, feature engineering, and model evaluation was implemented. Our study compares the effectiveness of several conventional machine learning methods, such as Random Forest, Support Vector Machines (SVM), Logistic Regression, and XGBoost, against transformer-based models such as Multilingual BERT(mBERT) and MizBERT. According to experimental data, MizBERT achieves the highest accuracy and F1-score, outperforming all other models by a substantial margin. This work contributes to the development of computational tools for Mizo NLP, laying a foundation for automated moderation systems and fostering digital inclusivity for Mizo-Speaking communities. 2025 IEEE.
Source: Proceedings - IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS;Issue;2025;
Date: 01-01-2025
Publisher: Institute of Electrical and Electronics Engineers Inc.
Subject: abusive comment detection; content moderation; digital safety; Logistic Regression; low-resource NLP; machine learning; MizBERT; Mizo language; multilingual BERT (mBERT); n-grams; Random Forest; Support Vector Machines (SVM); TF-IDF vectorization; tonal language processing; transformer models; XGBoost
Coverage: Lalrinmawii R., Christ University, Department of Data Science, India; Lalramhluna R., National Institute of Technology, Department of Computer Science & Engineering, India; Gunavathi R., Christ University, Department of Data Science, India
Rights: Restricted Access; Hardcopy may be available in the library
Relation: ISSN: 26436213;
Format: online
Language: English
Type: Conference paper
Identifier: https://doi.org/10.1109/AVSS65446.2025.11149909

https://www.scopus.com/pages/publications/105017422795?origin=resultslist

Collection

Citation

Lalrinmawii, R.; Lalramhluna, Robert; Gunavathi, R., “Detecting Abusive Comments in Mizo: A Machine Learning Approach for a Low-Resource Language,” CHRIST (Deemed To Be University) Institutional Repository, accessed June 18, 2026, https://archives.christuniversity.in/items/show/25769.

Collection

Citation

Output Formats