chunky.typesΒΆ

Core data structures for the semantic chunking pipeline.

Classes

Chunk(chunk_id, text, source_document, ...)

A chunk of text ready for downstream indexing.

ChunkerConfig(max_chars, min_chunk_chars, ...)

Configuration shared across chunkers.

Document(path, content, language, metadata, ...)

Normalized representation of a file to be chunked.