Mermaid AI:
Mermaid Chart is a diagramming tool that allows you to create diagrams using text. It is built by the team behind the award-winning open-source project, Mermaid JS.
‘Diagrams’ is underselling the power of Mermaid. They have native text representations of everything from Quadrant charts to Kanban boards.
When we think about designing complex agentic flows, time-and-time-again we run into the same issue; it’s really hard for LLMs to stay on track over long inferential distances. Imagine a simple representation of a task that you might want to give to an agent:
“Contact everybody on the project and get consensus on point x.”
In order to solve this, as the LLM starts sending emails and aggregating information, the simple representation above evolves into a highly complex one; even for a simple task.
After a few emails back and forth to the project members, the LLM would be sitting on a web of context; requests for information, dependencies between facts, different perspectives between departments, sub-tasks, red-herrings, meta conversations, links, citations, decisions, documents on remote servers, perhaps multiple versions of each.
Maintaining sufficient context to actually process these in the correct order is really hard.
This is actually a depth problem, rather than a breadth problem, the issue isn’t how long can the LLM process sequential items—it can do that forever. The question is actually, how can the LLM move up and down the ladder of abstraction to understand the current state of its world, to not get lost down dead ends, to understand the bigger picture and how that has been decomposed. How it can be recomposed.
I think Mermaid might have an opportunity to help here, it’s a concise way of storing complex information as a set of incredibly flexible text-based representations.