CV¶
🎓 Education¶
University of Cambridge, Cambridge, UK
Master’s in Advanced Computer Science
Oct. 2024 – July 2025
Texas A&M University, College Station, TX
BS in Computer Science · Minor in Economics
Overall GPA: 4.0/4.0
Honors Fellows · Engineering Honors · Undergraduate Research Scholar
Aug. 2020 – May 2024
💼 Industry Experience¶
Driveline Baseball, Kent, WA
Data Engineering Intern
May 2023 – Aug. 2023
- Developed and deployed pipeline to extract and upload athlete training data from FTP server to MySQL database
- Built internal Slack applications integrating the Zoho REST API using Python and the Slack SDK
Oklahoma City Thunder, Oklahoma City, OK
Software Engineering Intern
May 2022 – Dec. 2022
- Designed ETL pipeline to ingest NBA statistics for internal research using Python, PostgreSQL, and JSON, reducing storage usage by 97%
- Implemented Postgres functions for analysts to interface with data
📑 Publications & Theses¶
- Thorat, S. (2025). DACTYL: Diverse Adversarial Corpus of Texts Yielded from Large Language Models. Master’s thesis, University of Cambridge.
- Thorat, S. and Yang, T. (2024). Which LLMs are Difficult to Detect? A Detailed Analysis of Potential Factors Contributing to Difficulties in LLM Text Detection. In NeurIPS Safe Generative AI Workshop 2024.
- Delanoy, G., Lupardus, C., Vali, S. W., Wofford, J. D., Thorat, S., and Lindahl, P. A. (2024). Mössbauer and EPR detection of iron trafficking kinetics and possibly labile iron pools in whole Saccharomyces cerevisiae cells. Journal of Biological Chemistry, 300(9).
-
Thorat, S. (2024). Are AI-Generated Texts Detectable? An Experimental Study using the LibAUC Library. Bachelor’s thesis, Texas A&M University.
-
Thorat, S., Walton, J. R., and Lindahl, P. A. (2023). A kinetic model of iron trafficking in growing Saccharomyces cerevisiae cells; applying mathematical methods to minimize the problem of sparse data and generate viable autoregulatory mechanisms. PLOS Computational Biology, 19(12):e1011701.
🏆 Awards¶
Cambridge-McKinsey Risk Prize — June 2025
- 1st place for essay on risk management regarding AI-generated text detection
Senior Capstone Team Project — Cyber Canary (sponsored by Lockheed Martin)
Aug. 2023 – Dec. 2023
- Built ETL pipeline and downloadable vulnerability report
- Awarded 1st Place at Fall 2023 CSE Capstone Expo
- Developed desktop application for identifying vulnerabilities in JS projects
President’s Endowed Scholarship — Texas A&M University
Aug. 2020 – May 2024
- Full tuition scholarship awarded to National Merit Finalist scholars based on academic and leadership merit
🛠 Skills¶
Selected Coursework
- Undergraduate: Database Systems, Artificial Intelligence, Machine Learning, Data Analytics for Cybersecurity, Mathematical Economics, Macro/Microeconomic Theory
- Graduate: Natural Language Processing, Reinforcement Learning, The Future of LLMs (Data, Architectures, Ethics)
Technologies
Python · Java · C++ · Wolfram Mathematica · SQL · PostgreSQL · MySQL · Django · JavaScript · TypeScript · Linux · Android Development · AWS (S3, Lambda) · NumPy · pandas · PyTorch · scikit-learn · React · Angular · REST APIs · GitHub · GitLab · CI/CD (GitHub, GitLab) · R · R Shiny · LoRA fine-tuning (PEFT) · transformers · Plotly · matplotlib · seaborn · Streamlit