Chess Cheat Detection: Analyzing PGN Files

An automated framework for detecting potential cheating in chess games through statistical analysis of Portable Game Notation (PGN) files, developed as part of an MTech dissertation in AI & ML at BITS Pilani.

Project Overview

This project addresses the growing concern of chess cheating in online platforms by analyzing gameplay patterns using centipawn loss (CPL) metrics and machine learning techniques. The system identifies statistical anomalies that may indicate computer assistance during gameplay.

Key Features

PGN File Analysis: Parse and analyze chess games in standard PGN format
Centipawn Loss Calculation: Evaluate move quality against optimal engine recommendations
Statistical Pattern Detection: Identify suspicious gameplay patterns and anomalies
Data Visualization: Generate comprehensive graphs and charts for pattern analysis
Machine Learning Classification: 95% accuracy achieved using Neural Network models
Multi-factor Analysis: Consider game length, move timing, and contextual factors

Technologies Used

Python 3.x
Chess Libraries: python-chess, chess.pgn
Engine Analysis: Stockfish 16 (depth 20)
Data Processing: pandas, numpy
Visualization: matplotlib, seaborn
Machine Learning: scikit-learn, tensorflow

Detection Methodology

Primary Indicators

Consistently low CPL across complex positions
Unnatural CPL stability throughout games
High alignment with top engine choices in critical positions
Performance incongruent with player rating

Secondary Indicators

Unusual game length patterns
Inconsistent time usage patterns
Performance spikes in specific periods
Drastic improvement without corresponding rating increase

Classification Confidence Levels

Low Confidence: Only secondary indicators present
Medium Confidence: One primary + multiple secondary indicators
High Confidence: Multiple primary indicators present

Installation & Setup

Clone the repository

git clone https://github.com/0xafraidoftime/Chess-Cheat-Detection.git
cd Chess-Cheat-Detection

Install required dependencies

pip install python-chess pandas matplotlib seaborn scikit-learn tensorflow stockfish

Download Stockfish Engine

Download Stockfish 16 from official website
Ensure it's accessible in your system PATH

Project Structure

chess-cheat-detection/
├── chess_engine.py          # Basic PGN parsing functionality
├── chess_engine2.py         # Enhanced analysis with CPL extraction
├── CPL plot.py             # Visualization and plotting utilities
├── engine3.py              # Additional analysis features
├── game length distribution.py  # Game length analysis
├── data/
│   ├── a.pgn               # Sample PGN files
│   ├── game.pgn
│   ├── new_game.pgn
│   └── annotated_game2.pgn
├── analysis/
│   ├── *.csv               # Generated analysis results
│   └── pgn_analysis.csv    # Summary statistics
└── README.md

Usage

Basic PGN Analysis

from chess_engine import parse_pgn

# Parse a PGN file
moves = parse_pgn("your_game.pgn")
print("Extracted moves:", moves)

Centipawn Loss Analysis

# Run CPL analysis and generate visualizations
python "CPL plot.py"

Game Length Distribution

# Analyze game length patterns
python "game length distribution.py"

Cheat Detection

from chess_engine2 import analyze_game

# Analyze a game for potential cheating
analyze_game("suspicious_game.pgn")

Performance Metrics

Algorithm	Accuracy	Precision	Recall	F1-Score
Decision Tree	85%	88%	83%	85%
Random Forest	92%	90%	93%	91%
Neural Network	95%	94%	96%	95%

Key Findings

CPL Analysis: Effectively differentiates fair from engine-assisted play
Game Context: Complexity, phase, and time control are crucial for accurate detection
Multi-factorial Approach: Reduces false positives significantly
Pattern Recognition: Identifies bimodal distribution in game lengths for suspicious accounts

Limitations

Requires annotated PGN files or Stockfish integration for evaluation
May not detect sophisticated cheating (intentional suboptimal moves)
Dataset size impacts ML model accuracy
Computational intensity for real-time analysis
Potential false positives with very strong human players

Future Enhancements

Technical Improvements

Real-time analysis capability for tournament monitoring
Cloud-based implementation for scalability
Mobile application for arbiters and tournament directors
Integration with chess platform APIs

Research Extensions

Advanced neural network architectures (LSTM, Transformer)
Incorporation of psychological factors and playing style analysis
Application to other strategic games (Go, Shogi)
Self-improving detection through continuous learning

Research Background

This project was developed as part of an MTech dissertation in AI & ML at BITS Pilani under the supervision of Milin Shah (VP & Technology Manager, Bank of America). The research addresses the increasing prevalence of chess cheating in online environments, particularly since 2020.

Contributing

Contributions are welcome! Please feel free to submit issues, feature requests, or pull requests to improve the detection algorithms and expand the project's capabilities.

Citation

If you use this work in your research, please cite:

Pal, A. (2025). Cheat Detection in Chess: Analyzing PGN Files. 
MTech Dissertation, BITS Pilani. Supervisor: Milin Shah.

License

This project is open-source and available under the MIT License. See LICENSE file for details.

Acknowledgments

Supervisor: Milin Shah, Bank of America
Institution: BITS Pilani Department of AI & ML
Chess Community Contributors: Manan Pahwa (Purdue University), Moin Memon (Bank of America)
Technical Support: Stockfish Development Team, Python-chess Library Contributors
Data Sources: Chess.com Research Dataset, Lichess Open Database

Contact

Ankita Pal
BITS ID: 2022AC05327
Email: [email protected]

This project aims to maintain competitive integrity in chess while providing transparent, objective methods for cheat detection that can assist tournament arbiters and online platforms.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
2022AC05327_MEF.docx		2022AC05327_MEF.docx
2022AC05327_Mentor Evaluation.pdf		2022AC05327_Mentor Evaluation.pdf
2022ac05327.pdf		2022ac05327.pdf
2022ac05327_final.docx		2022ac05327_final.docx
AUTHORS		AUTHORS
Advanced-topics.md		Advanced-topics.md
CITATION.cff		CITATION.cff
CONTRIBUTING.md		CONTRIBUTING.md
CPL plot.py		CPL plot.py
Chess_Dissertation_Mid Semester.docx		Chess_Dissertation_Mid Semester.docx
Compiling-from-source.md		Compiling-from-source.md
Copying.txt		Copying.txt
Developers.md		Developers.md
Dissertation.pdf		Dissertation.pdf
Dissertation.pptx		Dissertation.pptx
Download-and-usage.md		Download-and-usage.md
Governance-and-responsibilities.md		Governance-and-responsibilities.md
Home.md		Home.md
Makefile		Makefile
Outline - Dissertation.docx		Outline - Dissertation.docx
README.md		README.md
Regression-Tests.md		Regression-Tests.md
Stockfish-FAQ.md		Stockfish-FAQ.md
Terminology.md		Terminology.md
UCI-&-Commands.md		UCI-&-Commands.md
UNLICENCE		UNLICENCE
Useful-data.md		Useful-data.md
_Footer.md		_Footer.md
a.pgn		a.pgn
a.pgn_cpl_analysis.csv		a.pgn_cpl_analysis.csv
affine_transform.h		affine_transform.h
affine_transform_sparse_input.h		affine_transform_sparse_input.h
annotated_game.pgn		annotated_game.pgn
annotated_game.pgn_cpl_analysis.csv		annotated_game.pgn_cpl_analysis.csv
annotated_game2.pgn		annotated_game2.pgn
annotated_game2.pgn_cpl_analysis.csv		annotated_game2.pgn_cpl_analysis.csv
benchmark.cpp		benchmark.cpp
benchmark.h		benchmark.h
bitboard.cpp		bitboard.cpp
bitboard.h		bitboard.h
chess_engine.py		chess_engine.py
chess_engine2.py		chess_engine2.py
clipped_relu.h		clipped_relu.h
engine.cpp		engine.cpp
engine.h		engine.h
engine3.py		engine3.py
evaluate.cpp		evaluate.cpp
evaluate.h		evaluate.h
game length distribution.py		game length distribution.py
game.pgn		game.pgn
game.pgn_cpl_analysis.csv		game.pgn_cpl_analysis.csv
half_ka_v2_hm.cpp		half_ka_v2_hm.cpp
half_ka_v2_hm.h		half_ka_v2_hm.h
incbin.h		incbin.h
main.cpp		main.cpp
memory.cpp		memory.cpp
memory.h		memory.h
misc.cpp		misc.cpp
misc.h		misc.h
movegen.cpp		movegen.cpp
movegen.h		movegen.h
movepick.cpp		movepick.cpp
movepick.h		movepick.h
network.cpp		network.cpp
network.h		network.h
new_game.pgn		new_game.pgn
new_game.pgn_cpl_analysis.csv		new_game.pgn_cpl_analysis.csv
nnue_accumulator.h		nnue_accumulator.h
nnue_architecture.h		nnue_architecture.h
nnue_common.h		nnue_common.h
nnue_feature_transformer.h		nnue_feature_transformer.h
nnue_misc.cpp		nnue_misc.cpp
nnue_misc.h		nnue_misc.h
numa.h		numa.h
perft.h		perft.h
pgn_analysis.csv		pgn_analysis.csv
position.cpp		position.cpp
position.h		position.h
score.cpp		score.cpp
score.h		score.h
search.cpp		search.cpp
search.h		search.h
simd.h		simd.h
sqr_clipped_relu.h		sqr_clipped_relu.h
tbprobe.cpp		tbprobe.cpp
tbprobe.h		tbprobe.h
testgame1.md		testgame1.md
thread.cpp		thread.cpp
thread.h		thread.h
thread_win32_osx.h		thread_win32_osx.h
timeman.cpp		timeman.cpp
timeman.h		timeman.h
tt.cpp		tt.cpp
tt.h		tt.h
tune.cpp		tune.cpp
tune.h		tune.h
types.h		types.h
uci.cpp		uci.cpp
uci.h		uci.h
ucioption.cpp		ucioption.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Chess Cheat Detection: Analyzing PGN Files

Project Overview

Key Features

Technologies Used

Detection Methodology

Primary Indicators

Secondary Indicators

Classification Confidence Levels

Installation & Setup

Project Structure

Usage

Basic PGN Analysis

Centipawn Loss Analysis

Game Length Distribution

Cheat Detection

Performance Metrics

Key Findings

Limitations

Future Enhancements

Technical Improvements

Research Extensions

Research Background

Contributing

Citation

License

Acknowledgments

Contact

About

Uh oh!

Releases

Packages

Languages

License

0xafraidoftime/Chess-Cheat-Detection

Folders and files

Latest commit

History

Repository files navigation

Chess Cheat Detection: Analyzing PGN Files

Project Overview

Key Features

Technologies Used

Detection Methodology

Primary Indicators

Secondary Indicators

Classification Confidence Levels

Installation & Setup

Project Structure

Usage

Basic PGN Analysis

Centipawn Loss Analysis

Game Length Distribution

Cheat Detection

Performance Metrics

Key Findings

Limitations

Future Enhancements

Technical Improvements

Research Extensions

Research Background

Contributing

Citation

License

Acknowledgments

Contact

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages