User Guide

Thema provides a topological data analysis pipeline that transforms raw tabular data into representative graph models through preprocessing, dimensionality reduction, and Mapper graph construction.

How to Use This Guide

This guide is organized for both learning and reference:

Guides by Task

Getting Started

  • Installation - Install Thema via pip or set up development environment

  • Quickstart - Run the full pipeline with a minimal YAML configuration

  • Getting Started - Complete walkthrough from setup to results with uv

Pipeline Components

Workflows

Reference

  • Best Practices - Recommended workflows, parameter choices, and troubleshooting

  • Testing - Test suite information

  • Overview - High-level architecture and terminology

Quick Navigation

Task

Guide

Install Thema

Installation

Run first pipeline

Quickstart

Understand parameters

Data Preprocessing, Embeddings, Graphs & Selection

Build without YAML

Manual Configuration Guide

Tune hyperparameters

Tuning and Selection

Filter and select models

Tuning and Selection

Write custom filters

Customizing Thema

Optimize performance

Best Practices

Troubleshoot issues

Best Practices