Skip to content

CV

Download CV as PDF

🎓 Education

University of Cambridge, Cambridge, UK
Master’s in Advanced Computer Science
Oct. 2024 – July 2025

Texas A&M University, College Station, TX
BS in Computer Science · Minor in Economics
Overall GPA: 4.0/4.0
Honors Fellows · Engineering Honors · Undergraduate Research Scholar
Aug. 2020 – May 2024


💼 Industry Experience

Driveline Baseball, Kent, WA
Data Engineering Intern
May 2023 – Aug. 2023
- Developed and deployed pipeline to extract and upload athlete training data from FTP server to MySQL database
- Built internal Slack applications integrating the Zoho REST API using Python and the Slack SDK

Oklahoma City Thunder, Oklahoma City, OK
Software Engineering Intern
May 2022 – Dec. 2022
- Designed ETL pipeline to ingest NBA statistics for internal research using Python, PostgreSQL, and JSON, reducing storage usage by 97%
- Implemented Postgres functions for analysts to interface with data


📑 Publications & Theses

  • Thorat, S. (2025). DACTYL: Diverse Adversarial Corpus of Texts Yielded from Large Language Models. Master’s thesis, University of Cambridge.
  • Thorat, S. and Yang, T. (2024). Which LLMs are Difficult to Detect? A Detailed Analysis of Potential Factors Contributing to Difficulties in LLM Text Detection. In NeurIPS Safe Generative AI Workshop 2024.
  • Delanoy, G., Lupardus, C., Vali, S. W., Wofford, J. D., Thorat, S., and Lindahl, P. A. (2024). Mössbauer and EPR detection of iron trafficking kinetics and possibly labile iron pools in whole Saccharomyces cerevisiae cells. Journal of Biological Chemistry, 300(9).
  • Thorat, S. (2024). Are AI-Generated Texts Detectable? An Experimental Study using the LibAUC Library. Bachelor’s thesis, Texas A&M University.

  • Thorat, S., Walton, J. R., and Lindahl, P. A. (2023). A kinetic model of iron trafficking in growing Saccharomyces cerevisiae cells; applying mathematical methods to minimize the problem of sparse data and generate viable autoregulatory mechanisms. PLOS Computational Biology, 19(12):e1011701.


🏆 Awards

Cambridge-McKinsey Risk PrizeJune 2025
- 1st place for essay on risk management regarding AI-generated text detection

Senior Capstone Team Project — Cyber Canary (sponsored by Lockheed Martin)
Aug. 2023 – Dec. 2023
- Built ETL pipeline and downloadable vulnerability report
- Awarded 1st Place at Fall 2023 CSE Capstone Expo
- Developed desktop application for identifying vulnerabilities in JS projects

President’s Endowed Scholarship — Texas A&M University
Aug. 2020 – May 2024
- Full tuition scholarship awarded to National Merit Finalist scholars based on academic and leadership merit


🛠 Skills

Selected Coursework
- Undergraduate: Database Systems, Artificial Intelligence, Machine Learning, Data Analytics for Cybersecurity, Mathematical Economics, Macro/Microeconomic Theory
- Graduate: Natural Language Processing, Reinforcement Learning, The Future of LLMs (Data, Architectures, Ethics)

Technologies
Python · Java · C++ · Wolfram Mathematica · SQL · PostgreSQL · MySQL · Django · JavaScript · TypeScript · Linux · Android Development · AWS (S3, Lambda) · NumPy · pandas · PyTorch · scikit-learn · React · Angular · REST APIs · GitHub · GitLab · CI/CD (GitHub, GitLab) · R · R Shiny · LoRA fine-tuning (PEFT) · transformers · Plotly · matplotlib · seaborn · Streamlit