cloudyn.png
Wednesday August 17th, 2022 2:47AM

Google gets more multilingual, but will it get the nuance?

By The Associated Press
Related Articles
  Contact Editor

LIMA, Peru (AP) — About 10 million people speak Quechua, but trying to automatically translate emails and text messages into the most widely spoken Indigenous language family in the Americas was long all but impossible.

That changed on Wednesday, when Google added Quechua and a variety of other languages to its digital translation service.

The internet giant says new artificial intelligence technology is enabling it to vastly expand Google Translate’s repertoire of the world’s languages. It added 24 of them this week, including Quechua and other Indigenous South American languages such as Guarani and Aymara. It is also adding a number of widely spoken African and South Asian languages that have been missing from popular tech products.

“We looked at languages with very large, underserved populations," Google research scientist Isaac Caswell told reporters.

The news from the California company’s annual I/O technology showcase may be celebrated in many corners of the world. But it will also likely draw criticism from those frustrated by previous tech products that failed to understand the nuances of their language or culture.

Quechua was the lingua franca of the Inca Empire, which stretched from what is now southern Colombia to central Chile. Its status began to decline following the Spanish conquest of Peru more than 400 years ago.

Adding it to the languages recognized by Google is a big victory for Quechua language activists like Luis Illaccanqui, a Peruvian who created the website Qichwa 2.0, which includes dictionaries and resources for learning the language.

“It will help put Quechua and Spanish on the same status,” said Illaccanqui, who was not involved in Google's project.

Illaccanqui, whose last name in Quechua means “you are the lightning bolt,” said the translator will also help keep the language alive with a new generation of young people and teenagers, “who speak Quechua and Spanish at the same time and are fascinated by social networks.”

Caswell called the news a “very big technological step forward" because until recently, it was not possible to add languages if researchers couldn't find a big enough trove of online text — such as digital books, newspapers or social media posts — for their AI systems to learn from.

U.S. tech giants don’t have a great track record of making their language technology work well outside the wealthiest markets, a problem that’s also made it harder for them to detect dangerous misinformation on their platforms. Until this week, Google Translate was offered in European languages like Frisian, Maltese, Icelandic and Corsican — each with fewer than 1 million speakers — but not East African languages like Oromo and Tigrinya, which have millions of speakers.

The new languages will roll out this week. They won't yet be understood by Google's voice assistant, which limits them to text-to-text translations for now. Google said it is working on adding speech recognition and other capabilities, such as being able to translate a sign by pointing a camera at it.

That will be important for largely spoken languages like Quechua, especially in the health field, because many Peruvian doctors and nurses who only speak Spanish work in rural areas and “are unable to understand patients who speak mostly Quechua,” Illaccanqui said.

“The next frontier, or challenge, is to work on speech,” said Arturo Oncevay, a Peruvian machine translation researcher at the University of Edinburgh who co-founded a research coalition to improve Indigenous language technology across the Americas. “The native languages of the Americas are traditionally oral."

In its announcement, Google cautioned that the quality of translations in the newly added languages “still lags far behind" other languages it supports, such as English, Spanish and German, and noted that the models “will make mistakes and exhibit their own biases." But the company only added languages if its AI systems met a certain threshold of proficiency, Caswell said.

“If there’s a significant number of cases where it’s very wrong, then we would not include it," he said. “Even if 90% of the translations are perfect, but 10% are nonsense, that’s a little bit too much for us."

Google said its products now support 133 languages. The latest 24 are the largest single batch to be added since Google incorporated 16 new languages in 2010. What made the expansion possible is what Google is calling a “zero-shot” or “zero-resource” machine translation model — one that learns to translate into another language without ever seeing an example of it.

Facebook and Instagram parent company Meta introduced a similar concept called the Universal Speech Translator last year.

Google's model works by training a “single gigantic neural AI model” on about 100 data-rich languages, and then applying what it's learned to hundreds of other languages it doesn't know, Caswell said. “Imagine if you’re some big polyglot and then you just start reading novels in another language, you can start to piece together what it could mean based on your knowledge of language in general,” he said.

He said the new group ranges from smaller languages like Mizo, spoken in northeastern India by about 800,000 people, to more widely spoken languages like Lingala, spoken by around 45 million people across Central Africa.

It was more than 15 years ago — in 2006 — that Microsoft got some positive attention in South America with a software feature translating familiar Microsoft menus and commands into Quechua. But that was before the current wave of AI advancements in real-time translation.

Harvard University language scholar Américo Mendoza-Mori, who speaks Quechua, said getting Google's attention brings some needed visibility to the language in places like Peru, where Quechua speakers are still lacking in many public services. The survival of many of these languages “will depend on their use in digital contexts,” he said.

Another language scholar, Roberto Zariquiey, said he's skeptical that Google could make an effective language revitalization tool for Quechua, Aymara or Guarani without closer participation from community groups in the region.

“Languages are deeply linked to lives, to cultures, to ethnic groups and political organizations,” said Zariquiey, a linguist at the Pontifical Catholic University of Peru. “This should be taken into account.”

—-

The new languages added are: Assamese, Aymara, Bambara, Bhojpuri, Dhivehi, Dogri, Ewe, Guarani, Ilocano, Konkani, Krio, Lingala, Luganda, Maithili, Meiteilon (Manipuri), Mizo, Oromo, Quechua, Sanskrit, Sepedi, Sorani Kurdish, Tigrinya, Tsonga and Twi.

—-

O'Brien reported from Providence, Rhode Island.

  • Associated Categories: Associated Press (AP), AP Business, AP Business - Corporate News, AP Technology News
© Copyright 2022 AccessWDUN.com
All rights reserved. This material may not be published, broadcast, rewritten, or redistributed without permission.
Ukraine to hold first war crimes trial of captured Russian
Ukraine's top prosecutor says the country plans to hold its first war crimes trial of a captured Russian soldier
4:04PM ( 7 minutes ago )
Live updates | Ukrainian prez thanks US for $40B in aid
Ukrainian President Volodymyr Zelenskyy has thanked the U.S. for a new $40 billion aid package
3:58PM ( 13 minutes ago )
Gleyber day: Torres' 5 RBIs lead surging Yanks over Jays 5-3
Gleyber Torres drove in all New York’s runs with a go-ahead, three-run homer off José Berríos in the fourth inning and a two-run single, leading the surging Yankees over the Toronto Blue Jays 5-3  for their 15th win in 17 games
3:46PM ( 25 minutes ago )
Associated Press (AP)
Biden sees bigger role for US farms due to Ukraine war
President Joe Biden is vowing to help American farmers try to ease a global spike in food prices following Russia’s invasion of Ukraine
3:17PM ( 54 minutes ago )
Democrats seek criminal charges against Trump Interior head
Democrats on the House Natural Resources Committee have asked the Justice Department to investigate whether a Trump administration interior secretary engaged in possible criminal conduct while helping an Arizona developer get a crucial permit for a housing project
3:14PM ( 57 minutes ago )
Film producers defend safety in Alec Baldwin shooting
A film production company is contesting sanctions by New Mexico officials for alleged workplace safety violations on the set of “Rust,” where actor and producer Alec Baldwin fatally shot a cinematographer in October
3:11PM ( 1 hour ago )
AP Business
Live updates | Separatists celebrate in Donetsk
Separatists in Donetsk on Wednesday celebrated the eighth anniversary of self-proclaimed independence from Ukraine
12:46PM ( 3 hours ago )
Google strikes content deals with 300 European publishers
Google says it struck licensing deals with 300 news publishers in Europe in its latest effort to comply with a recently introduced European Union copyright law
12:17PM ( 3 hours ago )
Live updates | 2 Romanian journalists detained in Moldova
Romania’s foreign ministry says that two Romanian journalists were detained Tuesday in the Russia-backed breakaway region of Transnistria in neighboring Moldova
11:53AM ( 4 hours ago )
AP Business - Corporate News
Running Twitter may be much harder than Elon Musk thinks
On Tuesday, Elon Musk said he would reverse Twitter’s ban of former President Donald Trump, who was booted in January 2021 for inciting violence at the U.S. Capitol, should he succeed in acquiring the social platform for $44 billion
1:09PM ( 3 hours ago )
EU commission proposes plan to fight child pornography
The European Union’s executive arm has proposed requiring online platforms to detect and report the sharing of child sex abuse images on the internet
8:44AM ( 7 hours ago )
Musk says he would reverse Twitter's ban of Donald Trump
Tesla CEO Elon Musk said he will reverse Twitter’s ban of former President Donald Trump if his deal to buy the social media company goes through
5:49PM ( 22 hours ago )
AP Technology News
Ukraine to hold first war crimes trial of captured Russian
Ukraine's top prosecutor says the country plans to hold its first war crimes trial of a captured Russian soldier
4:04PM ( 7 minutes ago )
Live updates | Ukrainian prez thanks US for $40B in aid
Ukrainian President Volodymyr Zelenskyy has thanked the U.S. for a new $40 billion aid package
3:58PM ( 13 minutes ago )
Gleyber day: Torres' 5 RBIs lead surging Yanks over Jays 5-3
Gleyber Torres drove in all New York’s runs with a go-ahead, three-run homer off José Berríos in the fourth inning and a two-run single, leading the surging Yankees over the Toronto Blue Jays 5-3  for their 15th win in 17 games
3:46PM ( 25 minutes ago )
Stocks stumble lower after inflation remains high in April
Stocks fell on Wall Street Wednesday after a report on inflation came in worse than feared
3:41PM ( 31 minutes ago )
Braves beat Riley, Cards defeat O'Neill in arbitration
The Atlanta Braves beat third baseman Austin Riley and the St. Louis Cardinals defeated outfielder Tyler O’Neill in the first salary arbitration decisions this year
3:37PM ( 34 minutes ago )