Launched this week

Clusy

Launched this week

AI notebook platform for modern data science

164 followers

AI notebook platform for modern data science

164 followers

Visit website

AI Coding Agents

•

Cloud Computing Platforms

•

AI Code Editors

Clusy is an agent-native notebook platform for researchers and data teams to build, branch, run, and evaluate ML and data science workflows in the cloud. Describe a goal in natural language, and Clusy plans the workflow, sources datasets, preprocesses data, runs parallel experiments in replicated kernels, compares model architectures, and helps produce optimal models through a human-in-the-loop notebook experience.

Free Options

Launch tags:Developer Tools•Artificial Intelligence•Data Science

Launch Team / Built With

Framer AI AgentsDesign and publish professional sites with AI

Promoted

Clusy

Maker

📌

Clusy is finally live! Sign up today for free, try our platform, and let us know what breaks. We built Clusy for ourselves with a lot of care, and are now launching to share it with the world. If you are using Jupyter or Google Colab (or any other Python notebook) in your work, we believe that Clusy can help optimize your productivity at almost no friction or migration cost. We started building because the way people work in notebooks has not really caught up with what AI now makes possible. Jupyter and Colab have been amazing, but we think the next generation of notebooks should feel much more goal-driven, collaborative, and agent-native. Therefore, Clusy is built around the end-to-end approach to data science - our agent works alongside you to help across the whole pipeline, starting from your idea and environment setup to actual model training and deployment. Sign up for a free plan or use CLUSYLAUNCH to get 50% off for the first 3 months!

Report

2d ago

@eldar_hasanov Interesting to see this land right as I'm dealing with the tail end of a similar problem — most "data science platform" tools assume you're staying inside the notebook the whole time, but a lot of real work ends with someone needing a client-facing PDF or report out of it, which usually means exporting to a totally different tool. Curious how Clusy handles that last step, or if it's staying notebook-native by design.

Report

1d ago

Clusy

Maker

@deepanshu_garg9 thank you for the feedback! Clusy is able to produce essentially any format of files, including figures and reports. We are adding a new skill that will allow it to better render PDFs and make this feature even more catered

Report

23h ago

The parallel-experiments-in-replicated-kernels part is exactly where I'd want to stress-test this.

When I do this by hand I lose the thread fast — three branches, each with slightly different preprocessing, and a week later I genuinely can't tell which dataset version + seed produced the model I ended up keeping.

So when Clusy branches and runs experiments in parallel, does each branch pin its own environment and dataset snapshot, so a result stays reproducible weeks later?

And when I pick a winner, can I merge that branch back into the main notebook cleanly — or does it live on as a separate artifact?

Report

21h ago

Clusy

Maker

@rudratosh great questions - each branch is its own replicated kernel state, so yes, it does have its own environment setup technically. And in terms of merging, you can do both: merge to main or export as its own snapshot!

Report

17h ago

How much of Clusy’s branching and dataset versioning is automated versus requiring manual setup?

Report

1d ago

Clusy

Maker

@thys_beesman almost completely automated! The best part is: you decide. You can jump into the process anytime or let the agent handle parts you don't like.

Report

1d ago

How does it handle large datasets that don't fit in memory, and what integrations does it currently support for pulling data from sources like S3 or BigQuery?

Report

22h ago

Clusy

Maker

@adnan664848 The current free plan sandbox has 20GB of memory allowance. Datasets that don't fit in memory are processed in batches. We use our own S3 at the moment, but we will soon release features that allow users to connect their own data sources (databases, etc.). We already support Snowflake and Databricks.

Report

17h ago

How does the human-in-the-loop part actually work when the agent is branching off into parallel experiments. Do you step in between runs or only after the comparison view comes back?

Report

20h ago

Clusy

Maker

@araszengin54995 you can do either!

Report

17h ago

How does the "agent-native" planning actually handle steps where I need to bring in proprietary data sources or private models that aren't publicly available?

Report

20h ago

Clusy

Maker

@abdurrahmaaz1o in the available tiers, we allow BYOK, but for very proprietary use cases, we can also discuss on-premise self-hosted enterprise options

Report

17h ago

how does the branching actually work under the hood, like can i fork an experiment midway and rerun only the changed cells without burning through compute on the whole pipeline again?

Report

19h ago

Clusy

Maker

@narinengnejqio basically yes!

Report

17h ago

1 2 3 4