A2M LogoA2M

LLM-Codex-Reference-Vault

MIT License

A high-fidelity semantic Knowledge Graph constructed from raw software documentation, cheat sheets, and technical books.

Overview

valid

This project bridges the gap between static Markdown/PDF technical files and LLM interactions by parsing documents into connected Chunk, CodeBlock, and Concept nodes within Neo4j, completely eliminating hallucinations via the Model Context Protocol (MCP). Features GPU-Accelerated PDF Processing: Offloads heavy PDF-to-Markdown conversion to a remote Docling instance. Image Stripping & Sanitization: Retroactively cleans extraneous Base64 images from Markdown files to preserve LLM token limits. Semantic Neo4j Graph: Constructs a connected graph mapping abstract technical concepts down to their raw code implementations. Automated Dev.to Integrations: Ships with ingest_devto.py for headless API ingestion of highly-rated articles via cron jobs. Native MCP Integration: Plug-and-play compatibility with Neo4j's official Model Context Protocol server.

Similar projects

Based on overlapping categories, tags, and capabilities.

workspace-mcp

1 views

Comprehensive, highly performant Google Workspace Streamable HTTP & SSE MCP Server for Calendar, Gmail, Docs, Sheets, Slides & Drive

agentaiapiautomationclaude

@nghoihin (Jack H. Ng)

0 views

Beever Atlas MCP server with Claude Code

AIgithubmcp

Zara Zhang (@zarazhangrui)

0 views

Frontend Slides skill (19k stars on GitHub) upgraded with a new design brain — pulls from Beautiful HTML Templates library to generate visually styled slides

AIgithub
Trust score100/100

About

This project bridges the gap between static Markdown/PDF technical files and LLM interactions by parsing documents into connected Chunk, CodeBlock, and Concept nodes within Neo4j, completely eliminating hallucinations via the Model Context Protocol (MCP).

Features GPU-Accelerated PDF Processing: Offloads heavy PDF-to-Markdown conversion to a remote Docling instance. Image Stripping & Sanitization: Retroactively cleans extraneous Base64 images from Markdown files to preserve LLM token limits. Semantic Neo4j Graph: Constructs a connected graph mapping abstract technical concepts down to their raw code implementations. Automated Dev.to Integrations: Ships with ingest_devto.py for headless API ingestion of highly-rated articles via cron jobs. Native MCP Integration: Plug-and-play compatibility with Neo4j's official Model Context Protocol server.

Author & Community

AuthorStephen Phillips(stephen.phillips.work@gmail.com)

Are you the author of this project?

Claim this pre-seeded listing to manage details, edit tags, or upload assets.

Please sign in using the button in the header to claim repository ownership.

Submitted via Web Dashboard.