All activity
Al
left a comment
Interesting!
The biggest challenge I have is compute cost (my models take hours/days to train). But spot instances are a hit and miss for me - I hate having to rerun my job all over again.. Can your "pre-emptive/spot instances with Atlas's built-in retry mechanism" help work around that?
Foundations Atlas
ML dev tool that saves you up to 8x in cloud GPU costs