An graduate class · CSCI 655 · Fall 2025

Generative AI for the making of software.

Fifteen graduate scholars spent twelve weeks at the College of William & Mary learning how code-aware language models actually behave: where they hallucinate, how to measure their failures, and how to build workflows that don't collapse in production. Eleven projects shipped this spring.

Cohort15
Capstones11 / 11
InstructorDr. Mastropaolo
ModulesIX
LabsIII
Cohort15
Shipped11
§ I The Playbook · An overview of the course structure

Eight chapters of theory and practice.

Each chapter is an overview of the topics studied in detail across lectures, lab handouts, and seminar discussions.

Chapter IWk 01-02
Lab A · shipped

Mining repositories

Collecting, cleaning, and tokenizing source-code data from public repositories. PyDriller, BPE, deduplication with MinHash, and the ethics of training on copyleft code.

repositoriesBPEASTslicensing
Inputpublic repos
Outputtokenized corpus
~20min read Open
Chapter IIWk 03
Pre-lab · spam

Modeling code

From n-grams to the naturalness hypothesis. Probability refresher, MLE, perplexity, smoothing, sampling temperature — and why code is more predictable than English.

MLEperplexitysmoothingsampling
Inputtoken streams
Outputspam classifier
~45min read Open
Chapter IIIWk 04
Lab B · shipped

Evaluating rigorously

Classification metrics, BLEU and its discontents, CodeBLEU, pass@k, embeddings, SIDE, and the unglamorous human-evaluation rubric the best papers include without making a show of it.

BLEUpass@kCodeBLEUSIDE
Inputmodel outputs
Outputeval rubric
~50min read Open
Chapter IVWk 05-06
Lab · completion

Deep learning

Neural networks, backpropagation, embeddings, LSTM/GRU, attention, transformers, autoregressive generation, pre-training, and fine-tuning — the engine room.

neural netsLSTMtransformersCodeBERT
Inputcode tokens
Outputcompletion model
~60min read Open
Chapter VWk 07-08
Lab C · shipped

Prompting LLMs

In-context learning, few-shot, chain-of-thought, prompt engineering, RAG, tool use, context-window management, prompt chaining, self-consistency, and prompt evaluation.

ICLCoTRAGtool use
Inputnatural prompts
Outputgrounded answers
~50min read Open
Chapter VIWk 09
Workshop · red-team

Hallucinations in code

How LLMs fabricate, the CodeHalu taxonomy, RAG mitigation, prompt defenses, tool-augmented generation, production case studies, and hallucination-resistant workflows.

CodeHaluRAG mitigationproduction cases
Inputmodel output
Outputred-team report
~50min read Open
Chapter VIIWk 10
Workshop · agents

Vibe coding & agents

Iterative human-LLM coding loops, autonomous coding agents with tool use, MCP, structured outputs, agent harnesses, evaluation under autonomy, and the practical patterns for shipping production code via natural-language workflows.

vibe codingagentsMCPtool use
Inputhuman intent
Outputshipped code
~55min read Open
Extra +Wk 11
Theory companion

NP-completeness

Reductions, hardness, and what LLMs do when the underlying problem isn't tractable. Where statistical pattern-matching collides with the unforgiving floor of computational complexity.

reductionsSAThardnesscomplexity
Inputhard problem
Outputhonest limits
~30min read Open
Extra +Wk 12
Capstone-adjacent

Genetic algorithms

Population search, fitness landscapes, crossover, fitness approximation with LLM predictors, the GA+LLM architecture, and the honest limits of evolutionary search over code.

populationselectionfitness approxGA + LLM
Inputcandidate pool
Outputevolved solution
~45min read Open
§ II The Cohort

Fifteen students.

The Fall 2025 roster of CSCI 655 — fifteen students across eleven capstone groups, sorted by student number. Open the cohort page for full profiles and per-student capstones.

Meet the cohort →

§ III Final Projects

Eleven capstones, nine domains.

Each group chose a real problem, scoped a system, and built something that runs. Reviewed against five rubric criteria — market analysis, differentiation, technical framework, vibe-coding, and live demo. All eleven groups shipped on schedule.

Group 01 Finance · markets

Stock Investment AI

An algorithmic stock-prediction interface with explainable retrieval-grounded recommendations. Pairs price-signal modeling with LLM-generated reasoning over filings.

Aidan Jeff
Group 02 Search · multimedia

Multimodal Video Indexing

Natural-language search across video archives, replacing brittle metadata-only retrieval with vision-and-language embeddings indexed at scene granularity.

Nathaniel Lily
Group 03 Sports · risk

Sports Betting Arbitrage

Real-time cross-sportsbook arbitrage detection with risk-aware position sizing. Surfaces price disagreements before they close.

Alan
Group 04 Tooling · data

PlotForge

A plotting interface for data analysis aimed at students, educators, and lightweight analysts. Natural-language to charting with iterative refinement.

James Jack
Group 05 Dev tools · QA

BURT++

A bug-report assistant that translates non-technical user complaints into actionable engineering tickets — clarifying reproduction steps as it goes.

Sam
Group 06 Civic · verification

GenAI Claim Verification

Retrieval-augmented evidence pipeline for verifying factual claims, attaching source citations with calibrated confidence.

Alice Camly
Group 07 Education · planning

W&M Degree Map

A planning tool for liberal-arts students navigating complex general-education requirements. Goal-aware course recommendations with clear-eyed prerequisite traversal.

Abby
Group 08 Sports · rules

RAG Rules · Ultimate Frisbee

A retrieval-augmented rules interpreter for self-officiated Ultimate Frisbee. Answers in-game questions by grounding in the official rulebook.

Krishna
Group 09 Education · code

CodeCaster

A coding assistant for social-science students learning to program for data analysis. Designed for the first hundred lines, not the next thousand.

Yibarek
Group 10 Civic · sports

Youth Sports Registration

A multilingual youth-sports registration platform with serious accessibility focus — built to reach families current platforms exclude.

Carter
Group 11 Career · jobs

AI-Powered Job Search

A unified career platform consolidating fragmented job-seeker tooling into one assistant — resume, search, outreach, and prep in a single workflow.

Walker