Codebase Structure

This section provides a comprehensive overview of the AZ-Go codebase organization, helping developers understand the architecture and locate specific functionality.

Directory Overview

AZ-Go/
├── configs/             # Configuration files
├── debug/               # Debug and testing utilities
├── distributed/         # Distributed training components
├── docs/                # Documentation (Jekyll site)
├── engine/              # Go engine implementation
├── go/                  # Core Go game logic
├── gtp/                 # Go Text Protocol interface
├── katago/              # KataGo integration for analysis
├── lifecycle/           # Training lifecycle management
├── logger/              # Logging utilities
├── neural_network/      # Neural network models
├── training/            # Training pipeline components
├── utils/               # General utilities
├── definitions.py       # Common path definitions
├── mcts.py              # Monte Carlo Tree Search implementation
├── start_main.py        # Main training orchestrator
└── start_worker.py      # Distributed worker process

Core Components

Game Logic (`go/`)

The fundamental Go game implementation:

game.py - Base game interface
go_game.py - Go-specific game implementation
go_logic.py - Core game rules and logic

Neural Network (`neural_network/`)

Deep learning models for position evaluation:

neural_net.py - Base neural network interface
neural_net_wrapper.py - PyTorch model wrapper
go_neural_net.py - Go-specific neural network implementation
go_alphanet.py - Main AlphaZero network architecture
training_utils.py - Training utilities (Bar and AverageMeter)

Monte Carlo Tree Search (`mcts.py`)

The MCTS algorithm implementation for move selection, featuring:

Tree search with UCB exploration
Virtual loss for parallel search
Dirichlet noise for exploration
Tree reuse between moves

Debug Tools (`debug/`)

Utility scripts for testing and debugging:

debug_self_play.py - Test self-play functionality
debug_arena.py - Test arena competitions
debug_scoring.py - Test game scoring
debug_worker.py - Test worker connections

Training Pipeline (`training/`)

Components for the self-play training loop:

overseer.py - Training overseer that manages iterations
self_play_manager.py - Manages self-play game generation
arena.py - Model comparison arena (class, not standalone script)
arena_manager.py - Manages arena competitions
worker.py - Distributed worker implementation

GTP Interface (`gtp/`)

Go Text Protocol implementation for GUI integration:

engine.py - Main GTP engine interface
engine_mcts.py - MCTS implementation for GTP engine
heatmap_generator.py - Visual heatmap generation for move analysis

Distributed System (`distributed/`)

Infrastructure for distributed training:

ssh_connector.py - SSH connection management
status_manager.py - Worker status tracking
status.txt - Worker status file

Engine Interface (`engine/`)

Go engine implementation for playing against the model:

engine.py - Main engine interface
run_engine.py - Engine runner script (accepts -cli flag)
engine_config.yaml - Engine configuration
build_engine.sh - Engine build script (PyInstaller)
README.md - Engine documentation

Analysis Tools (`katago/`)

Integration with KataGo for move analysis:

katago_wrapper.py - KataGo interface wrapper
katago_parameters.py - KataGo configuration
run_katago.py - Analysis runner (hardcoded to analyze sgf/figure_c1.sgf)
query_example.json - Example query format for KataGo
response_example.json - Example response format from KataGo
sgf/ - Sample game files for analysis
results/ - Analysis results storage

Entry Points

start_main.py - Main training orchestrator (no command-line arguments)
start_worker.py - Distributed worker process (no command-line arguments)
engine/run_engine.py - Standalone Go engine (accepts -cli flag only)
gtp/engine.py - GTP interface for Go GUIs (no command-line arguments)
katago/run_katago.py - KataGo analysis runner (hardcoded SGF file)

Configuration

All configuration is managed through configs/config.yaml, which controls:

Board size and game rules
Neural network architecture (RES or CNN)
MCTS parameters
Training hyperparameters
Distributed training settings

Data Flow

Self-Play: Workers generate games using MCTS + current model
Data Collection: Games are collected and stored as training examples
Neural Network Training: Model is trained on collected data
Model Evaluation: New model is compared against previous best
Model Update: If improved, new model becomes current best

Key Files Reference

File	Purpose
`mcts.py`	MCTS algorithm implementation
`neural_network/go_alphanet.py`	Neural network architecture
`go/go_logic.py`	Go game rules
`training/overseer.py`	Main training loop management

Development Guidelines

When modifying the codebase:

Game logic changes should be made in go/
Neural network modifications go in neural_network/
Training algorithm changes belong in training/
Always update configs/config.yaml for new parameters

Codebase Structure

Directory Overview

Core Components

Game Logic (go/)

Neural Network (neural_network/)

Monte Carlo Tree Search (mcts.py)

Debug Tools (debug/)

Training Pipeline (training/)

GTP Interface (gtp/)

Distributed System (distributed/)

Engine Interface (engine/)

Analysis Tools (katago/)