Mahesh Kumar

Mahesh Kumar

AI/ML Data Annotator & Curator | Enterprise IT Professional

Built the world's largest public labeled Hinglish dataset (125K+ instances) at IIT Gandhinagar. Passionate about multilingual NLP and reliable enterprise systems.

About Me

Former Technical Assistant with 2+ years of hands-on experience at IT Gandhinagar • LINGO Research Group and Bharat Electronics Limited (BEL). I specialize in multilingual NLP data curation, large-scale dataset annotation, and enterprise-grade IT infrastructure.


During my tenure at the LINGO Research Group, I worked under the guidance of Prof. Mayank Singh and closely collaborated with Rajvee Sheth. My work involved curating and annotating Hindi-English code-mixed datasets and supporting the development of large-scale benchmarks for multilingual NLP and LLM training.

Experience

Jun 2024 – Feb 2026

  • • Collaborated on the world's largest public Hinglish dataset for LLM training
  • • High-precision annotation using COMMENTATOR tool
  • • Created benchmarks for code-mixed Hindi-English NLP

Mar 2022 – Mar 2023

  • • Deployed enterprise VDI & data backup systems
  • • Managed hardware troubleshooting & network peripherals
  • • Achieved 99.9% system uptime

Projects

LINGO Research Group
COMI-LINGUA Dataset & COMMENTATOR Tool

Contributed to the world's largest public labeled Hindi-English code-mixed dataset. Performed granular linguistic annotation and benchmark creation for LLMs.

Hinglish NLP Data Annotation COMMENTATOR
View Project →
Personal
Secure Android Login System

Designed and implemented secure user authentication interfaces using XML layouts and Java in Android Studio with proper validation and encryption.

Android Studio Java + XML UI/UX Design

Education

Diploma in CS & Engineering

New Government Polytechnic Patna-13

79.11%

Class 12 (PCM)

Hellens School Rajopatti, Sitamarhi

56.6%

Class 10

Doon Senior Secondary School, Muzaffarpur

8.8 CGPA

Certifications

Cisco IT Essential

Verified

Comprehensive training in computer hardware and software components.

Cisco Networking Academy (2019)

Certified Android Apps Developer

Verified

Professional certification in mobile application design and deployment.

NIELIT, Patna (2021)

Web Technologies

Verified

Focused on full-stack web development and responsive design principles.

Arbazo Infotech Pvt. Ltd. (2019)

Kushal Yuva Program

Verified

Skill development program focusing on communication and IT literacy.

BSDM (2019)

Get In Touch

Whether you want to collaborate on a project, have a question, or just want to say hi, feel free to reach out!