Graphbook is a framework for building efficient, interactive DAG-structured data pipelines composed of your own custom-built nodes, and it all works best with PyTorch, Hugging Face, and more...
Extend the behavior of Graphbook's nodes and implement custom functions in Python in your favorite IDE.
Batching and multiprocessing IO are built-in making your development easy and your pipelines speedy.
Pipelines are interactive: pause, play, and view ML model outputs in realtime.
Write processing nodes using Python in your favorite code editor and keep your code version controlled
@step("DinoV2")
@batch(8, "images")
@param("model", "transformers/automodel", default="facebook/dinov2-base")
@param("processor", "transformers/autoimageprocessor", default="facebook/dinov2-base")
def process_images(ctx, images, items, notes):
inputs = ctx.processor(images=images, return_tensors='pt')
outputs = ctx.model(**inputs)
last_hidden_states = outputs.last_hidden_state
return last_hidden_states
@step("DinoV2")
@batch(8, "images")
@param(
"model",
"transformers/automodel",
default="facebook/dinov2-base"
)
@param(
"processor",
"transformers/autoimageprocessor",
default="facebook/dinov2-base"
)
def process_images(ctx, images, items, notes):
inputs = ctx.processor(images=images)
outputs = ctx.model(**inputs)
last_hidden_states = outputs.last_hidden_state
return last_hidden_states
Assemble a pipeline in our graph-based editor with your own processing nodes
Run, monitor, and adjust parameters in your workflow
Iterate, operate, and monitor an ML-based data processing pipeline all in one tool. Connect to any data source, use your own PyTorch or Tensorflow models, and maximize your GPU utilization without having to write tedious multiprocessing code.
Graphbook gives users the ability to easily develop solutions to various tasks that require AI/ML inference pipelines. It facilitates development by offering interactivity, visualizations, and multiprocessing IO. Built to address diverse needs, Graphbook significantly reduces pipeline development time.
Do not trust a third party with your data. Graphbook is always free and open source. Deploy your own Graphbook instances on-premise or in the cloud, and start building.
At its core, Graphbook is a framework for building efficient DAG-structured AI/ML data pipelines, but there are many features that help you build.
No. But you can build no-code ML for your customers and internal teams with this framework.
Yes. Your nodes are written in Python and workflows are serialized as .json files. You are recommended to track everything with Git.
Not yet, but we plan on adding this feature soon. For now, you must assemble pipelines in the UI.
In Graphbook, you can continue to use your workflow as-is or set new variables (directly in the workflow) such as where your production database is.
Yes, Graphbook is abstract enough where you can implement anything that can be written in Python including sending API requests to OpenAI.
The framework has a custom implementation of multiprocessing workers that run in the background for both loading and dumping to keep your GPU at max utilization.
We are actively looking for collaborators. You are very welcome to contribute! Visit our repo.