• Andani Madodonga, Vukosi Marivate, and Matthew Adendorff. Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati (To Appear), *Journal of the Digital Humanities Association of Southern Africa *, 2023. [NLP] <> [Paper URL] DOI: doi.org/10.55492/dhasa.v4i01.4449
  • Catherine Gitau and Vukosi Marivate. Textual Augmentation Techniques Applied to Low Resource Machine Translation: Case of SwahilI (To Appear), *Journal of the Digital Humanities Association of Southern Africa *, 2023. [NLP] <> [Paper URL] DOI: 10.55492/dhasa.v4i01.4446


  • D.I. Adelani, M.M.I. Alam, A. Anastasopoulos, A. Bhagia, M. Costa-Jussá, J. Dodge, F. Faisal, C. Federmann, N. Fedorova, F. Guzmán, V. Marivate, and others. Findings of the WMT 2022 shared task on large-scale machine translation evaluation for african languages, Proceedings of the Seventh Conference on Machine Translation, Online. Association for Computational Linguistics. 2022. [NLP] <> [Paper URL]
  • Herkulaas Combrink, Vukosi Marivate, and Benjamin Rosman. Reinforcement Learning in Education: A Multi-Armed Bandit Approach, arXiv preprint arXiv: Arxiv-2211.00779, 2022. [ML][SOC] <> [Paper URL] [Preprint URL] DOI:
  • Herkulaas MvE Combrink, Vukosi Marivate, and Benjamin Rosman. Comparing Synthetic Tabular Data Generation Between a Probabilistic Model and a Deep Learning Model for Education Use Cases, Proceedings of SACAIR2022 Online Conference, the 3rd Southern African Conference for Artificial Intelligence Research. 2022. [ML][SOC] <> [Paper URL] [Preprint URL] DOI:
  • Nicolle Garber and Vukosi Marivate. Conversational Pattern Mining using Motif Detection, Pan-African Artificial Intelligence and Smart Systems (To Appear). 2022. [ML][NLP] <> [Paper URL] [Preprint URL] DOI:
  • David Adelani, Graham Neubig, Sebastian Ruder, Shruti Rijhwani, Dietrich Klakow, Michael Coenraad Beukman, Chester Palen-Michel, Constantine Lignos, Jesujoba Alabi, Shamsuddeen Hassan Muhammad, Peter Nabende, Cheikh M. Bamba Dione, Andiswa Bukula, Rooweither Mabuya, Bonaventure F. P. Dossou, Blessing Sibanda, Happy Buzaaba, Jonathan Mukiibi, Godson K. KALIPE, Derguene Mbaye, Amelia Taylor, Fatoumata Ouoba Kabore, Chris Chinenye Emezue, Anuoluwapo Aremu, Perez Ogayo, Catherine Gitau, Edwin Munkoh-Buabeng, victoire Memdjokam Koagne, Allahsera Auguste Tapo, Tebogo Macucwa, Vukosi Marivate, MBONING TCHIAZE Elvis, Tajuddeen Gwadabe, Tosin Adewumi, Orevaoghene Ahia, Joyce Nakatumba-Nabende, Neo Lerato Mokono, Ignatius Ezeani, Chiamaka Chukwuneke, Mofetoluwa Oluwaseun Adeyemi, Gilles Quentin HACHEME, Idris Abdulmumin, Odunayo Jude Ogundepo, Oreen Yousuf, and Tatiana Moteu. AfroNER: Africa-centric Transfer Learning for Named Entity Recognition, Conference on Empirical Methods in Natural Language Processing (EMNLP). 2022. [ML][NLP] <> [Paper URL] [Preprint URL] DOI:
  • M. Ledwaba and V. Marivate. Semi-Supervised Learning Approaches for Predicting South African Political Sentiment for Local Government Elections, DG.O 2022: The 23rd Annual International Conference on Digital Government Research. 2022. [ML][NLP] <> [Paper URL] [Preprint URL] DOI: 10.1145/3543434.3543484
  • A. Modupe, T. Celik, V. Marivate, and O.O. Olugbara. Post-Authorship Attribution Using Regularized Deep Neural Network, Applied Sciences, 2022. [ML][NLP] <> [Paper URL] DOI: 10.3390/app12157518
  • R. Rockefeller, B. Bah, V. Marivate, and H.G. Zimmermann. Improving the Predictive Power of Historical Consistent Neural Networks, Engineering Proceedings, 2022. [ML] <> [Paper URL] DOI: 10.3390/engproc2022018036
  • S. Kabongo Kabenamualu, V. Marivate, and H. Kamper. LiSTra Automatic Speech Translation: English to Lingala Case Study, Proceedings of The Workshop on Dataset Creation for Lower-Resourced Languages within the 13th Language Resources and Evaluation Conference. 2022. [NLP] <> [Paper URL] [Preprint URL]
  • M. Mokoatle, D. Mapiye, V. Marivate, V.M. Hayes, and R. Bornman. Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods, PLOS ONE, 2022. [ML][NLP] <> [Paper URL] DOI: 10.1371/journal.pone.0267714
  • M. Makgatho, V. Marivate, T. Sefara, and V. Wagner. Training Cross-Lingual embeddings for Setswana and Sepedi, *Journal of the Digital Humanities Association of Southern Africa *, 2022. [NLP] <> [Paper URL] [Preprint URL] [Dataset] DOI: 10.55492/dhasa.v3i03.3822
  • T. Mokoena, T. Celik, and V. Marivate. Why is this an anomaly? Explaining anomalies using sequential explanations, Pattern Recognition, 2022. [ML] <> [Paper URL] [Preprint URL] DOI: 10.1016/j.patcog.2021.108227


  • K.D. Dhole, V. Gangal, S. Gehrmann, A. Gupta, Z. Li, S. Mahamood, A. Mahendiran, S. Mille, A. Srivastava, S. Tan, and others. NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation, arXiv preprint arXiv:2112.02721, 2021. [NLP] <> [Preprint URL]
  • H. de Wet and V. Marivate. Is it Fake? News Disinformation Detection on South African News Websites, 2021 IEEE AFRICON. 2021. [NLP][SOC] <> [Paper URL] [Preprint URL] [Dataset] DOI: 10.1109/AFRICON51333.2021.9570905
  • D. Behr, C. wa Maina, and V. Marivate. An empirical investigation into audio pipeline approaches for classifying bird species, 2021 IEEE AFRICON. 2021. [ML][SOC] <> [Paper URL] [Preprint URL] DOI: 10.1109/AFRICON51333.2021.9570862
  • L. Nthimo, T. Mokoena, A. Modupe, and V. Marivate. Call Centre Shift Schedule Optimisation using Local Search Heuristics, 2021 IEEE AFRICON. 2021. [ML] <> [Paper URL] DOI: 10.1109/AFRICON51333.2021.9570947
  • V. Marivate, A. Moodley, and A. Saba. Extracting and categorising the reactions to COVID-19 by the South African public-A social media study, 2021 IEEE AFRICON. 2021. [NLP][SOC] <> [Paper URL] [Preprint URL] DOI: 10.1109/AFRICON51333.2021.9571010
  • M. Terblanche and V. Marivate. Towards Financial Sentiment Analysis in a South African Landscape, International Cross-Domain Conference for Machine Learning and Knowledge Extraction. 2021. [NLP][SOC] <> [Paper URL] [Preprint URL] [Dataset] DOI: 10.1007/978-3-030-84060-0_12
  • V. Marivate, P. Aghoghovwia, Y. Ismail, F. Mahomed-Asmail, and S.L. Steenhuisen. The Fourth Industrial Revolution-what does it mean to our future faculty?, South African Journal of Science, 2021. [SOC] <> [Paper URL] [Preprint URL] DOI: 10.17159/sajs.2021/10702
  • O. Oladeji, C. Zhang, T. Moradi, D. Tarapore, A.C. Stokes, V. Marivate, M.D. Sengeh, E.O. Nsoesie, and others. Monitoring Information-Seeking Patterns and Obesity Prevalence in Africa With Internet Search Data: Observational Study, JMIR public health and surveillance, 2021. [NLP][SOC] <> [Paper URL] DOI: 10.2196/24348
  • T.J. Sefara, S.G. Zwane, N. Gama, H. Sibisi, P.N. Senoamadi, and V. Marivate. Transformer-based machine translation for low-resourced languages embedded with language identification, 2021 Conference on Information Communications Technology and Society (ICTAS). 2021. [NLP] <> [Paper URL] DOI: 10.1109/ICTAS50802.2021.9394996
  • M.U. Kraemer, S.V. Scarpino, V. Marivate, B. Gutierrez, B. Xu, G. Lee, J.B. Hawkins, C. Rivers, D.M. Pigott, R. Katz, and others. Data curation during a pandemic and lessons learned from COVID-19, Nature Computational Science, 2021. [SOC] <> [Paper URL] DOI: 10.1038/s43588-020-00015-6
  • C.H. Ngejane, J.H. Eloff, T.J. Sefara, and V.N. Marivate. Digital forensics supported by machine learning for the detection of online sexual predatory chats, Forensic science international: Digital investigation, 2021. [NLP] <> [Paper URL] [Preprint URL] DOI: 10.1016/j.fsidi.2021.301109
  • H. Combrink, V. Marivate, and B. Rosman. A Framework for Undergraduate Data Collection Strategies for Student Support Recommendation Systems in Higher Education, Southern African Conference for Artificial Intelligence Research. 2021. [ML][SOC] <> [Preprint URL]


  • W. Nekoto, V. Marivate, T. Matsila, T. Fasubaa, T. Fagbohungbe, S.O. Akinola, S. Muhammad, S.K. Kabenamualu, S. Osei, F. Sackey, and others. Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages, Findings of the Association for Computational Linguistics: EMNLP 2020. 2020. [NLP][SOC] <> [Paper URL] [Preprint URL] [Dataset] DOI: 10.18653/v1/2020.findings-emnlp.195
  • N. Mtsweni, H.M. Combrink, and V. Marivate. Mapping the South African health landscape in response to COVID-19, arXiv preprint arXiv:2006.15216, 2020. [SOC] <> [Preprint URL] [Dataset]
  • V. Marivate and T. Sefara. Improving short text classification through global augmentation methods, International Cross-Domain Conference for Machine Learning and Knowledge Extraction. 2020. [NLP] <> [Paper URL] [Preprint URL] DOI: 10.1007/978-3-030-57321-8_21
  • K. Naidoo and V. Marivate. Unsupervised anomaly detection of healthcare providers using generative adversarial networks, Conference on e-Business, e-Services and e-Society. 2020. [ML] <> [Paper URL] [Preprint URL] DOI: 10.1007/978-3-030-44999-5_35
  • V. Marivate and H.M. Combrink. Use of Available Data To Inform The COVID-19 Outbreak in South Africa: A Case Study, Data Science Journal, 2020. [SOC] <> [Paper URL] [Preprint URL] [Dataset] DOI: 10.5334/dsj-2020-019
  • V. Marivate, T. Sefara, V. Chabalala, K. Makhaya, T. Mokgonyane, R. Mokoena, and A. Modupe. Investigating an Approach for Low Resource Language Dataset Creation, Curation and Classification: Setswana and Sepedi, Proceedings of the first workshop on Resources for African Indigenous Languages. 2020. [NLP] <> [Paper URL] [Preprint URL] [Dataset]
  • I. Orife, J. Kreutzer, B. Sibanda, D. Whitenack, K. Siminyu, L. Martinus, J.T. Ali, J. Abbott, V. Marivate, S. Kabongo, and others. Masakhane–Machine Translation For Africa, arXiv preprint arXiv:2003.11529, 2020. [NLP][SOC] <> [Preprint URL]
  • V. Marivate. Why African natural language processing now? A view from South Africa# AfricaNLP, **, 2020. [ML][NLP] <> [Paper URL]
  • Henry Wandera, Vukosi Marivate, and David Sengeh. Investigating similarities and differences between South African and Sierra Leonean school outcomes using Machine Learning, CoRR, 2020. [ML][SOC] <> [Paper URL]


  • H. Wandera, V. Marivate, and M.D. Sengeh. Predicting National School Performance for Policy Making in South Africa, 2019 6th International Conference on Soft Computing & Machine Intelligence (ISCMI). 2019. [ML][SOC] <> [Paper URL]
  • A. Moodley and V. Marivate. Topic modelling of news articles for two consecutive elections in South Africa, 2019 6th International Conference on Soft Computing & Machine Intelligence (ISCMI). 2019. [NLP][SOC] <> [Paper URL]
  • M. Mokoatle, D. Vukosi Marivate, and P. Michael Esiefarienrhe Bukohwo. Predicting road traffic accident severity using accident report data in South Africa, Proceedings of the 20th annual international conference on digital government research. 2019. [ML][SOC] <> [Paper URL]


  • V. Marivate and N. Moorosi. Exploring data science for public good in South Africa: evaluating factors that lead to success, Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age. 2018. [ML][SOC] <> [Paper URL]
  • M. Mokoatle and V. Marivate. Collision Course: Challenges with Road Traffic Accident Data in South Africa, 2018 International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD). 2018. [ML][SOC] <> [Paper URL]


  • T. Mokoena, O. Lebogo, A. Dlaba, and V. Marivate. Bringing sequential feature explanations to life, 2017 IEEE AFRICON. 2017. [ML][NLP] <> [Paper URL]
  • V. Marivate and N. Moorosi. Employment relations: a data driven analysis of job markets using online job boards and online professional networks, Proceedings of the International Conference on Web Intelligence. 2017. [NLP][SOC] <> [Paper URL]
  • A. Modupe, T. Celik, V. Marivate, and M. Diale. Semi-supervised probabilistics approach for normalising informal short text messages, 2017 Conference on Information Communication Technology and Society (ICTAS). 2017. [NLP] <> [Paper URL]
  • N. Moorosi, M. Thinyane, and V. Marivate. A Critical and Systemic Consideration of Data for Sustainable Development in Africa, International Conference on Social Implications of Computers in Developing Countries. 2017. [SOC] <> [Paper URL]


  • P.M. Monamo, V. Marivate, and B. Twala. A multifaceted approach to Bitcoin fraud detection: Global and local outliers, 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA). 2016. [ML] <> [Paper URL]
  • V.N. Marivate and P. Moiloa. Catching crime: Detection of public safety incidents using social media, 2016 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech). 2016. [NLP][SOC] <> [Paper URL] DOI: 10.1109/RoboMech.2016.7813140
  • P. Monamo, V. Marivate, and B. Twala. Unsupervised learning for robust Bitcoin fraud detection, 2016 Information Security for South Africa (ISSA). 2016. [ML][SOC] <> [Paper URL] DOI: 10.1109/ISSA.2016.7802939


  • V.N. Marivate. Extracting South African safety and security incident patterns from social media, 2015 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech). 2015. [NLP][SOC] <> [Paper URL] DOI: 10.1109/RoboMech.2015.7359507
  • N. Moorosi and V. Marivate. Privacy in mining crime data from social Media: A South African perspective, 2015 Second International Conference on Information Security and Cyber Forensics (InfoSec). 2015. [ML][SOC] <> [Paper URL] DOI: 10.1109/InfoSec.2015.7435524