An graduate class · CSCI 655 · Fall 2025

Generative AI for the making of software.

Fifteen graduate scholars spent twelve weeks at the College of William & Mary learning how code-aware language models actually behave: where they hallucinate, how to measure their failures, and how to build workflows that don't collapse in production. Eleven projects shipped this spring.

Cohort15

Capstones11 / 11

InstructorDr. Mastropaolo

ModulesIX

LabsIII

Cohort15

Shipped11

Open the playbook → See the final projects

§ I The Playbook · An overview of the course structure

Eight chapters of theory and practice.

Each chapter is an overview of the topics studied in detail across lectures, lab handouts, and seminar discussions.

Chapter IWk 01-02

Lab A · shipped

Mining repositories

Collecting, cleaning, and tokenizing source-code data from public repositories. PyDriller, BPE, deduplication with MinHash, and the ethics of training on copyleft code.

repositoriesBPEASTslicensing

Inputpublic repos

Outputtokenized corpus

~20min read Open

Chapter IIWk 03

Pre-lab · spam

Modeling code

From n-grams to the naturalness hypothesis. Probability refresher, MLE, perplexity, smoothing, sampling temperature — and why code is more predictable than English.

MLEperplexitysmoothingsampling

Inputtoken streams

Outputspam classifier

~45min read Open

Chapter IIIWk 04

Lab B · shipped

Evaluating rigorously

Classification metrics, BLEU and its discontents, CodeBLEU, pass@k, embeddings, SIDE, and the unglamorous human-evaluation rubric the best papers include without making a show of it.

BLEUpass@kCodeBLEUSIDE

Deep learning

Neural networks, backpropagation, embeddings, LSTM/GRU, attention, transformers, autoregressive generation, pre-training, and fine-tuning — the engine room.

neural netsLSTMtransformersCodeBERT

Inputcode tokens

Outputcompletion model

~60min read Open

Chapter VWk 07-08

Lab C · shipped

Prompting LLMs

In-context learning, few-shot, chain-of-thought, prompt engineering, RAG, tool use, context-window management, prompt chaining, self-consistency, and prompt evaluation.

ICLCoTRAGtool use

Inputnatural prompts

Outputgrounded answers

~50min read Open

Chapter VIWk 09

Workshop · red-team

Hallucinations in code

How LLMs fabricate, the CodeHalu taxonomy, RAG mitigation, prompt defenses, tool-augmented generation, production case studies, and hallucination-resistant workflows.

CodeHaluRAG mitigationproduction cases

Inputmodel output

Outputred-team report

~50min read Open

Chapter VIIWk 10

Workshop · agents

Vibe coding & agents

Iterative human-LLM coding loops, autonomous coding agents with tool use, MCP, structured outputs, agent harnesses, evaluation under autonomy, and the practical patterns for shipping production code via natural-language workflows.

vibe codingagentsMCPtool use

Inputhuman intent

Outputshipped code

~55min read Open

+ Extras Beyond the core seven — optional companions to the capstone

Extra +Wk 11

Theory companion

NP-completeness

Reductions, hardness, and what LLMs do when the underlying problem isn't tractable. Where statistical pattern-matching collides with the unforgiving floor of computational complexity.

reductionsSAThardnesscomplexity

Inputhard problem

Outputhonest limits

~30min read Open

Extra +Wk 12

Capstone-adjacent

Genetic algorithms

Population search, fitness landscapes, crossover, fitness approximation with LLM predictors, the GA+LLM architecture, and the honest limits of evolutionary search over code.

populationselectionfitness approxGA + LLM

Inputcandidate pool

Outputevolved solution

~45min read Open

§ II The Cohort

Fifteen students.

The Fall 2025 roster of CSCI 655 — fifteen students across eleven capstone groups, sorted by student number. Open the cohort page for full profiles and per-student capstones.

Aidan BasloeGroup 01 · Stock AI

Sam BennettGroup 05 · BURT++

Nathaniel CallabresiGroup 02 · Video Index

Alan Gonzalez OsorioGroup 03 · Arbitrage

James HeGroup 04 · PlotForge

Walker HymanGroup 11 · Job Search

Alice JiGroup 06 · Claim Verify

Jeff LinGroup 01 · Stock AI

Abby SchwallGroup 07 · Degree Map

Jack StawaszGroup 04 · PlotForge

Krishna SwaminathanGroup 08 · RAG Rules

Yibarek TadesseGroup 09 · CodeCaster

Camly TranGroup 06 · Claim Verify

Lily WalkerGroup 02 · Video Index

Carter WilliamsonGroup 10 · Youth Sports

Meet the cohort →

§ III Final Projects

Eleven capstones, nine domains.

Each group chose a real problem, scoped a system, and built something that runs. Reviewed against five rubric criteria — market analysis, differentiation, technical framework, vibe-coding, and live demo. All eleven groups shipped on schedule.

Group 01 Finance · markets

Stock Investment AI

An algorithmic stock-prediction interface with explainable retrieval-grounded recommendations. Pairs price-signal modeling with LLM-generated reasoning over filings.

Aidan Jeff

Group 02 Search · multimedia

Multimodal Video Indexing

Natural-language search across video archives, replacing brittle metadata-only retrieval with vision-and-language embeddings indexed at scene granularity.

Nathaniel Lily

Group 03 Sports · risk

Sports Betting Arbitrage

Real-time cross-sportsbook arbitrage detection with risk-aware position sizing. Surfaces price disagreements before they close.

Alan

Group 04 Tooling · data

PlotForge

A plotting interface for data analysis aimed at students, educators, and lightweight analysts. Natural-language to charting with iterative refinement.

James Jack

Group 05 Dev tools · QA

BURT++

A bug-report assistant that translates non-technical user complaints into actionable engineering tickets — clarifying reproduction steps as it goes.

Sam

Group 06 Civic · verification

GenAI Claim Verification

Retrieval-augmented evidence pipeline for verifying factual claims, attaching source citations with calibrated confidence.

Alice Camly

Group 07 Education · planning

W&M Degree Map

A planning tool for liberal-arts students navigating complex general-education requirements. Goal-aware course recommendations with clear-eyed prerequisite traversal.

Abby

Group 08 Sports · rules