Skip to main content

TTS Language Engineer

Job ID: 1623305 | Evi Technologies Limited


Do you want to work on one of the coolest and most innovative pieces of technology in recent years? Come and join us and use your skills to enable our worldwide customers to interact with technology in the richest and most accessible way possible - through speech! We're the Text to Speech (TTS) team at Amazon. We build best-in-class TTS solutions and create magical experiences on Amazon's growing portfolio of speech products.

A day in the life
Our Language Engineers lead the building of TTS voices from beginning to end, and every day is a step closer to delivering the voice to our customers. You start your morning checking slack and emails overnight from other timezones: customer feedback, company-wide announcements, system notifications and updates from team members based overseas. You respond to these and notice that your customer has delivered you new materials to further enhance your voice with. After attending your stand-up, you might join a meeting with a software engineer about that feature you requested, or with the research team about a new technology they would like you to work with. You continue your evaluation, send out your results and respond to emails and slack for your colleagues around the world, and decide on how to train a new DNN speech model for an exciting, upcoming voice experience. Another day moving your projects forward, but it is always Day 1 at Amazon.

About the hiring group
We're the Amazon Text to Speech (TTS) team - bringing effortless, ubiquitous speech capabilities at scale for Amazon customers. We create natural and magical experiences for a growing portfolio of speech products, in the Cloud and for Devices. We are a diverse organisation of multidisciplinary computational linguists, scientists, software engineers and audio UX designers, organised into multifunctional, agile teams specialized in building state-of-the-art speech synthesis products. We start with our customers needs, design the solution, develop and apply technology and deliver the voice to production and support our customers using these increasingly life—like voices across an expanding range of languages and applications.

Job responsibilities
We are looking for a passionate, talented, and technology-savvy language engineer to help build industry-leading portfolio of TTS voices grounded on most recent advances in speech and language technology. Our mission is to push the envelope in Text-to-Speech (TTS) in order to provide the best-possible experience for our customers. Your role is to leverage your strong background in linguistics and language technologies to help build the next generation of our synthetic voices used by millions of customers every day. In this role, you will be a key member in delivering voices to our customers.

As a TTS Language Engineer you will work alongside audio engineers, researchers, and software development engineers who are subject matter experts in speech synthesis and related fields. You will actively collaborate with program managers to deliver new voice products to Amazon customers on international scale. You will be working on a variety of tasks from natural language processing, machine learning to linguistics as it relates to TTS, to create new voices and experiences. You will bootstrap and test new functionalities, analyze system performance and identify areas for improvement. You will collaborate with other language engineers and data associates in data collection and quality evaluation.

The ideal candidate will be familiar with speech and text processing techniques, with solid knowledge of linguistics or phonetics. We are particularly interested in candidates with experience in training and evaluation of machine learning and deep learning models. The successful candidates will link customer requirements to high-quality speech processing components and lead the development process in a cross-functional team.

Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice to know more about how we collect, use and transfer the personal data of our candidates.


· Undergraduate degree in Computational Linguistics, Speech Sciences, Linguistics, or a related field
· Knowledge of machine learning and commonly used ML packages and libraries
· Knowledge of scripting languages (e.g. Python, Perl, Ruby, bash)
· Knowledge of phonetics/phonology and ability to create/fix phonetic transcriptions and experience with regular expressions
· Working proficiency in English
· Knowledge of Unix/Linux command line tools
· Excellent written and spoken communication skills


· Native or near-native fluency in English
· Working experience in a second language other than English
· MSc in Computational Linguistics (or equivalent field with computational emphasis); alternatively, 2 years’ experience in the field
· Experience with machine learning or building machine/deep learning models
· Hands-on experience working with Natural Language Processing or Speech Processing
· Experience investigating the feasibility of applying scientific principles and concepts to business problems and products
· Strong personal interest in learning, researching, and creating new technologies related to foreign languages, linguistics, phonetics, phonology and language technology
· Practical knowledge of version control systems (e.g. Git) and agile development
· Feeling comfortable and motivated when working in a fast paced, highly collaborative, dynamic work environment