Pre-compiled inference rules for metadata extraction.
Processor configuration (chunk sizes, directories, maps).
Provider for generating text embeddings.
OptionalissuesOptional issues manager for tracking processing errors.
Pino logger instance.
OptionaltemplateOptional Handlebars template engine for content templates.
OptionalvaluesOptional values manager for tracking rule-extracted values.
Vector store for persistence.
Core document processing pipeline.
Handles extracting text, computing embeddings, and syncing with the vector store.