AI/ML Data Annotator & Curator | Enterprise IT Professional
Built the world's largest public labeled Hinglish dataset (125K+ instances) at IIT Gandhinagar. Passionate about multilingual NLP and reliable enterprise systems.
Former Technical Assistant with 2+ years of hands-on experience at IT Gandhinagar • LINGO Research Group and Bharat Electronics Limited (BEL). I specialize in multilingual NLP data curation, large-scale dataset annotation, and enterprise-grade IT infrastructure.
During my tenure at the LINGO Research Group, I worked under the guidance of Prof. Mayank Singh and closely collaborated with Rajvee Sheth. My work involved curating and annotating Hindi-English code-mixed datasets and supporting the development of large-scale benchmarks for multilingual NLP and LLM training.
Jun 2024 – Feb 2026
Mar 2022 – Mar 2023
Contributed to the world's largest public labeled Hindi-English code-mixed dataset. Performed granular linguistic annotation and benchmark creation for LLMs.
Designed and implemented secure user authentication interfaces using XML layouts and Java in Android Studio with proper validation and encryption.
Comprehensive training in computer hardware and software components.
Cisco Networking Academy (2019)Professional certification in mobile application design and deployment.
NIELIT, Patna (2021)Focused on full-stack web development and responsive design principles.
Arbazo Infotech Pvt. Ltd. (2019)Skill development program focusing on communication and IT literacy.
BSDM (2019)Whether you want to collaborate on a project, have a question, or just want to say hi, feel free to reach out!