Pre-compiled inference rules for metadata extraction.
Processor configuration (chunk sizes, directories, maps).
OptionalcontentOptional content hash cache for move detection.
Provider for generating text embeddings.
OptionalenrichmentOptional enrichment store for persisted enrichment metadata.
OptionalissuesOptional issues manager for tracking processing errors.
Pino logger instance.
OptionaltemplateOptional Handlebars template engine for content templates.
OptionalvaluesOptional values manager for tracking rule-extracted values.
Vector store for persistence.
Core document processing pipeline.
Handles extracting text, computing embeddings, and syncing with the vector store.