Portfolio

Arham V. Doshi

ML Research, NLP, and Software Engineering. I build systems at the intersection of language, AI, and real-world impact.

Computer Science and Linguistics student at the University of Texas at Austin, focused on LLM evaluation, multimodal reasoning, and applied machine learning for healthcare and education.

Experience

Austin, Texas | October 2025 - Present

Undergraduate Research Assistant

University of Texas at Austin - Discourse Lab

  • Developing psycholinguistically-grounded evaluation system for verbal uncertainty in LLM-generated medical evidence synthesis
  • Built automated pipeline processing 150+ GPT-4o responses across 3 experimental conditions using Azure OpenAI APIs
  • Designed JSON-structured uncertainty detection framework integrating 50 years of meta-analytic research on probability expressions
  • Collaborating with Kaijie Mo to prepare findings for publication on uncertainty expression in clinical AI systems

Houston, Texas | May 2025 - September 2025

Research Assistant for Dr. Hayden

Baylor College of Medicine - Hayden Lab

  • Transcribing and analyzing 4000 verbal responses from epilepsy patients performing arithmetic tasks to study cognitive-linguistic processing
  • Annotated and extracted acoustic features in Praat, enabling precise prosody analysis for epilepsy patient study
  • Transcribed morphemes and syntax using LLMs to analyze the neuro system and improved model by 12%

Dallas, Texas | July 2024 - February 2025

Research Assistant, Co-Author w/ Dr. Labiba

Southern Methodist University - CS Department

  • Published to CoNLL-2025 on socioeconomic status prediction using Reddit data and linguistic indicators
  • Engineered NLP pipelines with Logistic Regression, Naive Bayes, and transformer models for interpretable classification
  • Preprocessed and analyzed 30,000+ Reddit entries with NLTK and SpaCy using n-grams, sentiment, and LDA topic modeling

Southlake, Texas | September 2024 - February 2025

AI Software Engineering Intern

Product Manager Accelerator

  • Designed a college match platform using machine learning, speech-to-text, and a Siri-style accessibility interface
  • Delivered a top-3 final product pitch through a live demo and roadmap presentation

Remote | September 2024 - Present

Machine Learning Intern

ANIML.HEALTH - Project BALTO

  • Collaborating on advanced ML models for predicting animal health and improving veterinary diagnostics
  • Applied NLP methods to medical literature and health records for data-driven clinical strategy
  • Performed data cleaning, preprocessing, and feature engineering to improve predictive accuracy

Southlake, Texas | 2022 - May 2025

Designer & Coder

VEX Robotics - Southlake Carroll Chapter

  • Progressed from primary C++ coder to design specialist for pneumatics, catapult, and robot architecture
  • Maintained engineering notebook documenting design strategy and implementation that supported design award wins
  • Ranked top 10 in tournaments during 11th grade and qualified for VEX Robotics World Championship

Skills

Programming

PythonNumPyPandasscikit-learnPyTorchTensorFlowKerasGit/GitHub

NLP & AI

NLTKspaCyWord2VecBERTHugging FaceLDATopic ModelingSentiment Analysis

Machine Learning

Logistic RegressionNaive BayesSVMRandom ForestText ClassificationFeature Engineering

Education

University of Texas at Austin

Expected May 2028

B.S. Computer Science + Linguistics

Austin, Texas

LLMs, Object-Oriented Programming, Language and Computers

Carroll Senior High School

Graduated May 2025

High School Diploma

Southlake, Texas

AP Computer Science A/Principles, AP Calculus AB/BC, AP Statistics

Projects

Contact

Open to research collaborations, internships, and AI/ML opportunities.