Charly Walther

Lionbridge AI - Data for machine learning in 300+ languages 🌎🤓🤖

Lionbridge’s 500,000+ annotators label your text, audio, image, and video data for machine learning. Using our custom-built platform, we can:
🤖 Create chatbot training data
🖼 Label images and video
😍 Build sentiment analysis datasets
👩‍💻 Source annotators from around the world
💬 Improve machine translation

Add a comment

Replies

Best
Charly Walther
Hi everyone, we've been delivering ground truth data for NLP and computer vision applications to many of the largest tech companies in the world and we're excited to make these services accessible to anyone with a need for data creation or annotation services. We've recently invested in our entity annotation as well as text/audio/image categorization platforms (in addition to basically any other data sourcing and annotation task you can think of, just ask - we do anything from fairly simple remote microtasks to extremely complex NLP data tasks) and we'd love to offer you all some pilot projects to see how we can improve your ML model development! Any feedback is highly appreciated!
Marc Thomas
@charlywalther This is potentially really useful for us because we're the only survey platform to support the Welsh language and current NLP APIs we use don't support it. I can't see a list of 300 languages on your site though. Only this one (https://lionbridge.ai/languages/). Is there a complete list somewhere?
Charly Walther
We support Welsh!:) Please feel free to reach out and our team will calibrate with you on how to best support the data needs for your survey platform! We only list the most common languages since otherwise the list would get way too long:) We do some rather obscure languages and even regional dialects... @iammarcthomas
Marc Thomas
@charlywalther Love that. There's a good community of people who will benefit from this. Will be in touch!
Jordi Mon Companys
Hey this looks great, looking forward to trying it.
Lachlan Kirkwood
Love the data creation feature! Really valuable for datasets with missing features.