All activity
Alleft a comment
Interesting! The biggest challenge I have is compute cost (my models take hours/days to train). But spot instances are a hit and miss for me - I hate having to rerun my job all over again.. Can your "pre-emptive/spot instances with Atlas's built-in retry mechanism" help work around that?
Foundations AtlasML dev tool that saves you up to 8x in cloud GPU costs
