Nathan Brown and Vukosi Marivate. BOTS-LM: Training Large Language Models for Setswana, arXiv preprint arXiv: 2408.02239, 2024. [NLP] <> [Preprint URL]
Atnafu Lambebo Tonja, Bonaventure F. P. Dossou, Jessica Ojo, Jenalea Rajab, Fadel Thior, Eric Peter Wairagala, Aremu Anuoluwapo, Pelonomi Moiloa, Jade Abbott, Vukosi Marivate, and Benjamin Rosman. InkubaLM: A small language model for low-resource African languages, arXiv preprint arXiv: 2408.17024, 2024. [NLP] <> [Preprint URL]
Idris Abdulmumin, Sthembiso Mkhwanazi, Mahlatse S. Mbooi, Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad, Neo Putini, Miehleketo Mathebula, Matimba Shingange, Tajuddeen Gwadabe, and Vukosi Marivate. Correcting FLORES Evaluation Dataset for Four African Languages, arXiv preprint arXiv: 2409.00626, 2024. [NLP] <> [Preprint URL]
M. Malange, S. Rananga, M.S. Mbooi, B. Isong, and V. Marivate. Investigating the Effectiveness of Detecting Misinformation on Social Media using Tshivenda Language, 2024 IST-Africa Conference (IST-Africa). 2024. [NLP][SOC] <> [Paper URL] DOI:10.23919/IST-Africa63983.2024.10569873
D. WALKER, S. RANANGA, B. ISONG, and V. MARIVATE. Generalising Across Domains in Video Misinformation Detection, 2024 IST-Africa Conference (IST-Africa). 2024. [NLP][SOC] <> [Paper URL] DOI:10.23919/IST-Africa63983.2024.10569650
M. Mukwevho, S. Rananga, M.S. Mbooi, B. Isong, and V. Marivate. Building a Dataset for Misinformation Detection in the Low-Resource Language, 2024 IST-Africa Conference (IST-Africa). 2024. [NLP][SOC] <> [Paper URL] DOI:10.23919/IST-Africa63983.2024.10569562
Michelle Terblanche, Kayode Olaleye, and Vukosi Marivate. Prompting towards Alleviating Code-Switched Data Scarcity in Under-Resourced Languages with GPT as a Pivot, Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024. 2024. [NLP][SOC] <> [Paper URL] [Preprint URL]
2023
T.A. Sindane, V. Marivate, and M. Terblanche. Point of pivot: cross-lingual embeddings calibration for Southern Nguni and Niger-Congo low-resourced languages., Deep Learning Indaba 2023. 2023. [NLP] <> [Preprint URL]
Rozina Myoya, Lesego Matojane, Abiodun Modupe, Vukosi Marivate, and Albert Myburgh. Evaluating Drone Imagery for Wildlife Unique Feature Identification, Proceedings of the Fourth Southern African Conference for Artificial Intelligence Research. 2023. [ML][SOC] <> [Paper URL]
A. De Jager, V. Marivate, and A. Modupe. Multimodal Misinformation Detection in a South African Social Media Environment, Artificial Intelligence Research. SACAIR 2023. Communications in Computer and Information Science. 2023. [NLP][SOC] <> [Paper URL] [Preprint URL] DOI:10.1007/978-3-031-49002-6_19
Rockefeller, Bubacarr Bah, Hans-Georg Zimmermann, and Vukosi Marivate. Wind Power Prediction with HCNNs for Turbines, 2023 International Conference on Electrical, Computer and Energy Technologies (ICECET). 2023. [ML][ SOC] <> [Paper URL] DOI:10.1109/ICECET58911.2023.10389320
Vukosi Marivate, Moseli Mots’Oehli, Valencia Wagner, Richard Lastrucci, and Isheanesu Dzingirai. PuoBERTa: Training and evaluation of a curated language model for Setswana, Artificial Intelligence Research. SACAIR 2023. Communications in Computer and Information Science. 2023. [NLP] <> [Paper URL] [Preprint URL] [Dataset] [Software/Library] DOI:10.1007/978-3-031-49002-6_17
Baphumelele Masikisiki, Vukosi Marivate, and Yvette Hlope. Investigating the Efficacy of Large Language Models in Reflective Assessment Methods through Chain of Thoughts Prompting, 4th African Human Computer Interaction Conference (AfriCHI 2023). 2023. [NLP][ SOC] <> [Preprint URL] [Dataset] DOI:10.1145/3628096.3628747
T. Kekere, V. Marivate, and M. Hattingh. Exploring COVID-19 public perceptions in South Africa through sentiment analysis and topic modelling of Twitter posts, The African Journal of Information and Communication (AJIC), 2023. [NLP][ SOC] <> [Paper URL] DOI:10.23962/ajic.i31.14834
Abiodun Modupe, Turgay Celik, Vukosi Marivate, and Oludayo O. Olugbara. Integrating Bidirectional Long Short-Term Memory with Subword Embedding for Authorship Attribution, 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC). 2023. [NLP] <> [Preprint URL]
D. Ngomane, R. Mabuya, J. Abbott, and V. Marivate. Unsupervised Cross-lingual Word Embedding Representation for English-isiZulu, Proceedings of the Fourth workshop on Resources for African Indigenous Languages (RAIL 2023). 2023. [NLP] <> [Paper URL] [Dataset] [Video]
Trishanta Srikissoon and Vukosi Marivate. Combating Hate: How Multilingual Transformers Can Help Detect Topical Hate Speech, Proceedings of Society 5.0 Conference 2023. 2023. [NLP] <> [Paper URL] DOI:10.29007/1cm6
Marc Gagiano and Vukosi Marivate. Emotionally driven fake news in South Africa, Proceedings of Society 5.0 Conference 2023. 2023. [NLP] <> [Paper URL] DOI:10.29007/f35v
Cheikh M. Bamba Dione, David Adelani, Peter Nabende, Jesujoba Alabi, Thapelo Sindane, Happy Buzaaba, Shamsuddeen Hassan Muhammad, Chris Chinenye Emezue, Perez Ogayo, Anuoluwapo Aremu, Catherine Gitau, Derguene Mbaye, Jonathan Mukiibi, Blessing Sibanda, Bonaventure F. P. Dossou, Andiswa Bukula, Rooweither Mabuya, Allahsera Auguste Tapo, Edwin Munkoh-Buabeng, victoire Memdjokam Koagne, Fatoumata Ouoba Kabore, Amelia Taylor, Godson Kalipe, Tebogo Macucwa, Vukosi Marivate, Tajuddeen Gwadabe, Mboning Tchiaze Elvis, Ikechukwu Onyenwe, Gratien Atindogbe, Tolulope Adelani, Idris Akinade, Olanrewaju Samuel, Marien Nahimana, Théogène Musabeyezu, Emile Niyomutabazi, Ester Chimhenga, Kudzai Gotosa, Patrick Mizha, Apelete Agbolo, Seydou Traore, Chinedu Uchechukwu, Aliyu Yusuf, Muhammad Abdullahi, and Dietrich Klakow. MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African Languages, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics. 2023. [NLP] <> [Paper URL] [Preprint URL] [Dataset] DOI:10.18653/v1/2023.acl-long.609
H.M. Combrink, V. Marivate, and B. Masikisiki. Technology-Enhanced Learning, Data Sharing, and Machine Learning Challenges in South African Education, Education Sciences, 2023. [SOC] <> [Paper URL] DOI:10.3390/educsci13050438
K.D. Dhole, V. Gangal, S. Gehrmann, A. Gupta, Z. Li, S. Mahamood, A. Mahendiran, S. Mille, A. Srivastava, S. Tan, and others. NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation, Northern European Journal of Language Technology (NEJLT), 2023. [NLP] <> [Paper URL] [Preprint URL] [Software/Library]
Rendani Mbuvha, David I. Adelani, Tendani Mutavhatsindi, Tshimangadzo Rakhuhu, Aluwani Mauda, Tshifhiwa Joshua Maumela, Andisani Masindi, Seani Rananga, Vukosi Marivate, and Tshilidzi Marwala. MphayaNER: Named Entity Recognition for Tshivenda, arXiv preprint arXiv: Arxiv-2304.03952, 2023. [NLP] <> [Paper URL]
R. Lastrucci, J. Rajab, M. Shingange, D. Njini, and V. Marivate. Preparing the Vuk’uzenzele and ZA-gov-multilingual South African multilingual corpora, Proceedings of the Fourth workshop on Resources for African Indigenous Languages (RAIL 2023). 2023. [NLP] <> [Paper URL] [Preprint URL] [Dataset] [Video]
M. Mokoatle, V. Marivate, D. Mapiye, R. Bornman, V. Hayes, and others. A review and comparative study of cancer detection using machine learning: SBERT and SimCSE application, BMC bioinformatics, 2023. [NLP][SOC] <> [Paper URL] DOI:10.1186/s12859-023-05235-x
Andani Madodonga, Vukosi Marivate, and Matthew Adendorff. Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati, *Journal of the Digital Humanities Association of Southern Africa *, 2023. [NLP] <> [Paper URL] [Preprint URL] [Dataset] DOI:10.55492/dhasa.v4i01.4449
Catherine Gitau and Vukosi Marivate. Textual Augmentation Techniques Applied to Low Resource Machine Translation: Case of SwahilI, *Journal of the Digital Humanities Association of Southern Africa *, 2023. [NLP] <> [Paper URL] [Preprint URL] DOI:10.55492/dhasa.v4i01.4446
N. Garber and V. Marivate. Conversational Pattern Mining using Motif Detection, Pan-African Artificial Intelligence and Smart Systems: Second EAI International Conference, PAAISS 2022, Dakar, Senegal, November 2-4, 2022, Proceedings. 2023. [ML][NLP] <> [Paper URL] [Preprint URL] DOI:10.1007/978-3-031-25271-6_22
H.M. Combrink, V. Marivate, and B. Rosman. Reinforcement Learning in Education: A Multi-armed Bandit Approach, Emerging Technologies for Developing Countries. 2023. [ML][SOC] <> [Paper URL] [Preprint URL] DOI:10.1007/978-3-031-35883-8_1
2022
D.I. Adelani, M.M.I. Alam, A. Anastasopoulos, A. Bhagia, M. Costa-Jussá, J. Dodge, F. Faisal, C. Federmann, N. Fedorova, F. Guzmán, V. Marivate, and others. Findings of the WMT 2022 shared task on large-scale machine translation evaluation for african languages, Proceedings of the Seventh Conference on Machine Translation, Online. Association for Computational Linguistics. 2022. [NLP] <> [Paper URL]
Herkulaas MvE Combrink, Vukosi Marivate, and Benjamin Rosman. Comparing Synthetic Tabular Data Generation Between a Probabilistic Model and a Deep Learning Model for Education Use Cases, Proceedings of SACAIR2022 Online Conference, the 3rd Southern African Conference for Artificial Intelligence Research. 2022. [ML][SOC] <> [Paper URL] [Preprint URL] DOI:
D. Adelani, G. Neubig, S. Ruder, S. Rijhwani, M. Beukman, C. Palen-Michel, C. Lignos, J. Alabi, S. Muhammad, P. Nabende, C.M.B. Dione, A. Bukula, R. Mabuya, B.F.P. Dossou, B. Sibanda, H. Buzaaba, J. Mukiibi, G. Kalipe, D. Mbaye, A. Taylor, F. Kabore, C.C. Emezue, A. Aremu, P. Ogayo, C. Gitau, E. Munkoh-Buabeng, V. Memdjokam Koagne, A.A. Tapo, T. Macucwa, V. Marivate, M.T. Elvis, T. Gwadabe, T. Adewumi, O. Ahia, J. Nakatumba-Nabende, N.L. Mokono, I. Ezeani, C. Chukwuneke, M. Oluwaseun Adeyemi, G.Q. Hacheme, I. Abdulmumin, O. Ogundepo, O. Yousuf, T. Moteu, and D. Klakow. MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition, Conference on Empirical Methods in Natural Language Processing (EMNLP). 2022. [ML][NLP] <> [Paper URL] [Preprint URL] DOI:
M. Ledwaba and V. Marivate. Semi-Supervised Learning Approaches for Predicting South African Political Sentiment for Local Government Elections, DG.O 2022: The 23rd Annual International Conference on Digital Government Research. 2022. [ML][NLP] <> [Paper URL] [Preprint URL] DOI:10.1145/3543434.3543484
A. Modupe, T. Celik, V. Marivate, and O.O. Olugbara. Post-Authorship Attribution Using Regularized Deep Neural Network, Applied Sciences, 2022. [ML][NLP] <> [Paper URL] DOI:10.3390/app12157518
R. Rockefeller, B. Bah, V. Marivate, and H.G. Zimmermann. Improving the Predictive Power of Historical Consistent Neural Networks, Engineering Proceedings, 2022. [ML] <> [Paper URL] DOI:10.3390/engproc2022018036
S. Kabongo Kabenamualu, V. Marivate, and H. Kamper. LiSTra Automatic Speech Translation: English to Lingala Case Study, Proceedings of The Workshop on Dataset Creation for Lower-Resourced Languages within the 13th Language Resources and Evaluation Conference. 2022. [NLP] <> [Paper URL] [Preprint URL]
M. Mokoatle, D. Mapiye, V. Marivate, V.M. Hayes, and R. Bornman. Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods, PLOS ONE, 2022. [ML][NLP] <> [Paper URL] DOI:10.1371/journal.pone.0267714
M. Makgatho, V. Marivate, T. Sefara, and V. Wagner. Training Cross-Lingual embeddings for Setswana and Sepedi, *Journal of the Digital Humanities Association of Southern Africa *, 2022. [NLP] <> [Paper URL] [Preprint URL] [Dataset] DOI:10.55492/dhasa.v3i03.3822
T. Mokoena, T. Celik, and V. Marivate. Why is this an anomaly? Explaining anomalies using sequential explanations, Pattern Recognition, 2022. [ML] <> [Paper URL] [Preprint URL] DOI:10.1016/j.patcog.2021.108227
D. Behr, C. wa Maina, and V. Marivate. An empirical investigation into audio pipeline approaches for classifying bird species, 2021 IEEE AFRICON. 2021. [ML][SOC] <> [Paper URL] [Preprint URL] DOI:10.1109/AFRICON51333.2021.9570862
L. Nthimo, T. Mokoena, A. Modupe, and V. Marivate. Call Centre Shift Schedule Optimisation using Local Search Heuristics, 2021 IEEE AFRICON. 2021. [ML] <> [Paper URL] DOI:10.1109/AFRICON51333.2021.9570947
V. Marivate, A. Moodley, and A. Saba. Extracting and categorising the reactions to COVID-19 by the South African public-A social media study, 2021 IEEE AFRICON. 2021. [NLP][SOC] <> [Paper URL] [Preprint URL] DOI:10.1109/AFRICON51333.2021.9571010
M. Terblanche and V. Marivate. Towards Financial Sentiment Analysis in a South African Landscape, International Cross-Domain Conference for Machine Learning and Knowledge Extraction. 2021. [NLP][SOC] <> [Paper URL] [Preprint URL] [Dataset] DOI:10.1007/978-3-030-84060-0_12
V. Marivate, P. Aghoghovwia, Y. Ismail, F. Mahomed-Asmail, and S.L. Steenhuisen. The Fourth Industrial Revolution-what does it mean to our future faculty?, South African Journal of Science, 2021. [SOC] <> [Paper URL] [Preprint URL] DOI:10.17159/sajs.2021/10702
O. Oladeji, C. Zhang, T. Moradi, D. Tarapore, A.C. Stokes, V. Marivate, M.D. Sengeh, E.O. Nsoesie, and others. Monitoring Information-Seeking Patterns and Obesity Prevalence in Africa With Internet Search Data: Observational Study, JMIR public health and surveillance, 2021. [NLP][SOC] <> [Paper URL] DOI:10.2196/24348
T.J. Sefara, S.G. Zwane, N. Gama, H. Sibisi, P.N. Senoamadi, and V. Marivate. Transformer-based machine translation for low-resourced languages embedded with language identification, 2021 Conference on Information Communications Technology and Society (ICTAS). 2021. [NLP] <> [Paper URL] DOI:10.1109/ICTAS50802.2021.9394996
M.U. Kraemer, S.V. Scarpino, V. Marivate, B. Gutierrez, B. Xu, G. Lee, J.B. Hawkins, C. Rivers, D.M. Pigott, R. Katz, and others. Data curation during a pandemic and lessons learned from COVID-19, Nature Computational Science, 2021. [SOC] <> [Paper URL] DOI:10.1038/s43588-020-00015-6
C.H. Ngejane, J.H. Eloff, T.J. Sefara, and V.N. Marivate. Digital forensics supported by machine learning for the detection of online sexual predatory chats, Forensic science international: Digital investigation, 2021. [NLP] <> [Paper URL] [Preprint URL] DOI:10.1016/j.fsidi.2021.301109
H. Combrink, V. Marivate, and B. Rosman. A Framework for Undergraduate Data Collection Strategies for Student Support Recommendation Systems in Higher Education, Southern African Conference for Artificial Intelligence Research. 2021. [ML][SOC] <> [Preprint URL]
2020
W. Nekoto, V. Marivate, T. Matsila, T. Fasubaa, T. Fagbohungbe, S.O. Akinola, S. Muhammad, S.K. Kabenamualu, S. Osei, F. Sackey, and others. Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages, Findings of the Association for Computational Linguistics: EMNLP 2020. 2020. [NLP][SOC] <> [Paper URL] [Preprint URL] [Dataset] DOI:10.18653/v1/2020.findings-emnlp.195
N. Mtsweni, H.M. Combrink, and V. Marivate. Mapping the South African health landscape in response to COVID-19, arXiv preprint arXiv:2006.15216, 2020. [SOC] <> [Preprint URL] [Dataset]
V. Marivate and T. Sefara. Improving short text classification through global augmentation methods, International Cross-Domain Conference for Machine Learning and Knowledge Extraction. 2020. [NLP] <> [Paper URL] [Preprint URL] [Software/Library] DOI:10.1007/978-3-030-57321-8_21
K. Naidoo and V. Marivate. Unsupervised anomaly detection of healthcare providers using generative adversarial networks, Conference on e-Business, e-Services and e-Society. 2020. [ML] <> [Paper URL] [Preprint URL] DOI:10.1007/978-3-030-44999-5_35
V. Marivate and H.M. Combrink. Use of Available Data To Inform The COVID-19 Outbreak in South Africa: A Case Study, Data Science Journal, 2020. [SOC] <> [Paper URL] [Preprint URL] [Dataset] DOI:10.5334/dsj-2020-019
V. Marivate, T. Sefara, V. Chabalala, K. Makhaya, T. Mokgonyane, R. Mokoena, and A. Modupe. Investigating an Approach for Low Resource Language Dataset Creation, Curation and Classification: Setswana and Sepedi, Proceedings of the first workshop on Resources for African Indigenous Languages. 2020. [NLP] <> [Paper URL] [Preprint URL] [Dataset]
I. Orife, J. Kreutzer, B. Sibanda, D. Whitenack, K. Siminyu, L. Martinus, J.T. Ali, J. Abbott, V. Marivate, S. Kabongo, and others. Masakhane–Machine Translation For Africa, arXiv preprint arXiv:2003.11529, 2020. [NLP][SOC] <> [Preprint URL]
V. Marivate. Why African natural language processing now? A view from South Africa# AfricaNLP, **, 2020. [ML][NLP] <> [Paper URL]
Henry Wandera, Vukosi Marivate, and David Sengeh. Investigating similarities and differences between South African and
Sierra Leonean school outcomes using Machine Learning, CoRR, 2020. [ML][SOC] <> [Paper URL]
2019
H. Wandera, V. Marivate, and M.D. Sengeh. Predicting National School Performance for Policy Making in South Africa, 2019 6th International Conference on Soft Computing & Machine Intelligence (ISCMI). 2019. [ML][SOC] <> [Paper URL]
A. Moodley and V. Marivate. Topic modelling of news articles for two consecutive elections in South Africa, 2019 6th International Conference on Soft Computing & Machine Intelligence (ISCMI). 2019. [NLP][SOC] <> [Paper URL]
M. Mokoatle, D. Vukosi Marivate, and P. Michael Esiefarienrhe Bukohwo. Predicting road traffic accident severity using accident report data in South Africa, Proceedings of the 20th annual international conference on digital government research. 2019. [ML][SOC] <> [Paper URL]
2018
V. Marivate and N. Moorosi. Exploring data science for public good in South Africa: evaluating factors that lead to success, Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age. 2018. [ML][SOC] <> [Paper URL]
M. Mokoatle and V. Marivate. Collision Course: Challenges with Road Traffic Accident Data in South Africa, 2018 International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD). 2018. [ML][SOC] <> [Paper URL]
2017
T. Mokoena, O. Lebogo, A. Dlaba, and V. Marivate. Bringing sequential feature explanations to life, 2017 IEEE AFRICON. 2017. [ML][NLP] <> [Paper URL]
V. Marivate and N. Moorosi. Employment relations: a data driven analysis of job markets using online job boards and online professional networks, Proceedings of the International Conference on Web Intelligence. 2017. [NLP][SOC] <> [Paper URL]
A. Modupe, T. Celik, V. Marivate, and M. Diale. Semi-supervised probabilistics approach for normalising informal short text messages, 2017 Conference on Information Communication Technology and Society (ICTAS). 2017. [NLP] <> [Paper URL]
N. Moorosi, M. Thinyane, and V. Marivate. A Critical and Systemic Consideration of Data for Sustainable Development in Africa, International Conference on Social Implications of Computers in Developing Countries. 2017. [SOC] <> [Paper URL]
2016
P.M. Monamo, V. Marivate, and B. Twala. A multifaceted approach to Bitcoin fraud detection: Global and local outliers, 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA). 2016. [ML] <> [Paper URL]
V.N. Marivate and P. Moiloa. Catching crime: Detection of public safety incidents using social media, 2016 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech). 2016. [NLP][SOC] <> [Paper URL] DOI:10.1109/RoboMech.2016.7813140
P. Monamo, V. Marivate, and B. Twala. Unsupervised learning for robust Bitcoin fraud detection, 2016 Information Security for South Africa (ISSA). 2016. [ML][SOC] <> [Paper URL] DOI:10.1109/ISSA.2016.7802939
2015
V.N. Marivate. Extracting South African safety and security incident patterns from social media, 2015 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech). 2015. [NLP][SOC] <> [Paper URL] DOI:10.1109/RoboMech.2015.7359507
N. Moorosi and V. Marivate. Privacy in mining crime data from social Media: A South African perspective, 2015 Second International Conference on Information Security and Cyber Forensics (InfoSec). 2015. [ML][SOC] <> [Paper URL] DOI:10.1109/InfoSec.2015.7435524