Docling – Open Source Integrated AI and Semantic Tech

See Flexible GraphRAG Initial Version Blog Post

See New Tabbed UI for Flexible GraphRAG (and Flexible RAG)

Flexible GraphRAG on GitHub

X.com Steve Reiner @stevereiner LinkedIn Steve Reiner LinkedIn

Improved the performance of flexible-graphrag
- Added doing parallel Docling document conversion helped pipeline timing
- Now not doing KeywordExtractor/SummaryExtractor also helped pipeline timing
- Ollama Parallel Processing (need OLLAMA_NUM_PARALLEL=4)
- Async PropertyGraphIndex with use_async=True
- Increased kg_batch_size from 10 to 20 chunk
- Logging added for performance timing
Added performance testing results to readme.md (6 docs with openai with each graph database (neo4j, kuzu, falkordb)
Added docs/performance.md: has performance testing results for each graph database with 2,4,6 docs with openai and 2,4 docs with ollama
Added support for FalkorDB graph database https://www.falkordb.com/ and https://github.com/FalkorDB/falkordb The abstractions of LlamaIndex, LlamaIndex support for FalkorDB, and the configurability of flexible-graphrag made this a relatively straightforward process.
Added LlamaIndex DynamicLLMPathExtractor support (works on openai, not on ollama currently)
Added config of kg extractor type (simple, schema, or dynamic) to set which LlamaIndex extractor to use (SimpleLLMPathExtractor, SchemaLLMPathExtractor, or DynamicLLMPathExtractor)
Added config of MAX_TRIPLETS_PER_CHUNK and MAX_PATHS_PER_CHUNK
Added readme.md info on system environment setup of ollama for performance and parallelism (OLLAMA_CONTEXT_LENGTH, OLLAMA_NUM_PARALLEL, etc.)
Added new default schema with 35+ relationship combinations, more relations, and entity types: PERSON, ORGANIZATION, TECHNOLOGY, PROJECT, LOCATION
Fixed file upload dialog performance in all 3 front ends: React, Angular, and Vue (chosen files display quickly after dialog ok)

Tag: Docling

Flexible-GraphRAG: Performance improvements, FalkorDB graph database support added