Diego Marani, a writer and interpreter, recently described a moment that sticks in your memory. Serving as a simultaneous interpreter at an altar during an ecumenical council, he came to the conclusion that translation was not his calling. The position, the clerical tone, and the raised arms in the particular gesture of liturgical address were all part of the job of becoming a priest. He pointed out that what he accomplished that day was beyond the capabilities of any machine. It’s a very specific, almost personal example. However, it highlights a point that is often overlooked in the current discussion about AI translation: language is more than just information transferring between containers.
These days, AI is truly adept at transferring that data. A decade ago, the state of real-time voice-to-voice translation would have seemed unthinkable. It is based on neural networks that capture phonetic characteristics, regional vocabulary, and emotional register. When a person in a Chicago meeting room speaks English and their Tokyo counterpart hears Japanese, the gap closes almost instantly. The friction that once made communicating in a foreign language difficult is disappearing. The language barrier is no longer an obstacle in the majority of practical interactions, such as business negotiations, travel arrangements, and customer service. It would be strange to act otherwise; that is a real accomplishment.
But pay attention to what’s going on beneath it. When major AI systems are tested for multilingualism, researchers at Brown University, the University of Oregon, and Hong Kong University of Science and Technology consistently find that these models translate into English much more accurately than they translate out of it. When working in Korean, Vietnamese, Indonesian, or Tamil, they are more likely to produce false information and perform significantly worse when responding to factual questions in non-English languages. One well-known chatbot accurately responded in English when asked in Spanish where the next White Lotus season would take place, but it only provided “somewhere in Asia” when asked in the translated version of the same question. The technology is not as smooth. It is approximate in one direction and fluent in the other, which is toward English.
Beyond the technical, this asymmetry is significant. It’s possible that AI translation is subtly strengthening English’s hegemony in international trade and communication rather than undermining it. More English text on the internet, more English in scholarly publications, and more English in the data sets that teach these models how language functions are examples of training data that reflects current power disparities. “If you don’t publish papers in English, you’re not relevant,” stated Pascale Fung, director of the Center for AI Research at the Hong Kong University of Science and Technology and a seven-language speaker. The AI tools being developed to overcome linguistic barriers are partially the result of the same structural bias that they are meant to address.
AI Language Translation & the Synthesized Polyglot — Key Facts & Context
| Core Technology | Real-time voice-to-voice neural translation, synthetic voice cloning, multilingual large language models |
| Current Capability | Advanced AI systems can translate spoken language in real-time across dozens of languages, capturing phonetic qualities and emotional intent |
| Notable Systems | Translation frameworks including Hermes the Polyglot; multilingual LLM models (GPT-4, PaLM 2, and others) |
| Speed | Near-conversational real-time translation; comparable to human interpreter speeds in many contexts |
| English Bias Problem | AI chatbots perform substantially better in English than in other languages; models are less accurate, more prone to fabrication, and culturally flatter in non-English outputs |
| Training Data Imbalance | Most large language model training data is in English and Chinese; minority and regional languages severely underrepresented |
| One-Way Fluency Issue | Studies show AI translates INTO English far better than FROM English — raising equity concerns for non-English-speaking users |
| Digital Divide Risk | Stanford research (2025) and WIRED reporting (2023) document how AI language gaps exclude non-English-speaking communities from economic and informational opportunity |
| Cultural Nuance Gap | Machine translation cannot reliably capture regional humor, metaphor, cultural history, or code-switching — areas where human bilingualism retains clear advantage |
| Impact on Language Education | The Atlantic (2024) published “The End of Foreign-Language Education” — arguing AI is reducing the survival/travel utility of learning a second language |
| Researcher Concern | University of Oregon, Brown University, and Hong Kong University studies document chatbot bias toward English; increased risk of homogenizing global culture |
| Counterargument | Some academics argue AI deepens language engagement by enabling personalized practice without judgment, shifting motivation from utility to cultural curiosity |
| Endangered Languages | AI tools are being explored to document and preserve endangered languages; also risks of leaving low-resource language communities behind |
| OpenAI Acknowledgment | GPT-4 technical report acknowledged majority training data is English with “a US-centric point of view” |

There is also the cultural dimension, which is more difficult to quantify but may have greater long-term implications. Regional humor, metaphor, the particular meanings of words that have no clear equivalent, and the silence that can mean one thing in a Japanese business conversation and another in a Brazilian family dinner are just a few examples of the things that language carries that word-for-word or even sentence-level translation cannot consistently replicate. AI systems trained on English-dominated data are likely to completely overlook the fact that the Basque word “uso,” which means dove, can serve as an insult, giving it the serene symbolism that the English-speaking internet associates with the bird. These are not examples of edge cases. They represent the actual use of language.
There is a sense that the discussion is being held at the incorrect level of abstraction as this technology advances. Will AI eliminate the need to learn a foreign language? This is the topic of discussion in scholarly articles and editorial columns. is true, but it takes a backseat to a more unsettling one: what happens to regional and minority languages around the world when the main impact of AI translation is to make English more accessible while further lagging behind Swahili, Quechua, and Tagalog? According to Stanford research from 2025, the communities are completely shut out of AI tools because their languages don’t produce enough training data for the models to care about, not because translation is flawed.
The reversibility of this trajectory is still unknown. For Latin American, African, and Southeast Asian dialects, some researchers are creating new training data sets. With its PaLM 2 model, Google has made credible attempts to train on over 100 languages. These are real steps. However, the amount of English text continues to increase, and large language models’ statistical architecture will inevitably favor the language with the greatest representation. There is a synthetic polyglot. The question of whether it speaks for everyone or primarily for those who have already been heard is still up for debate.
London Bilingualism's content on health, medicine, and weight loss is solely meant for general educational and informational purposes. This website does not offer any diagnosis, treatment recommendations, or medical advice.
We consistently compile and disseminate the most recent information, findings, and advancements from the medical, health, and weight loss sectors. When content contains opinions, commentary, or viewpoints from professionals, industry leaders, or other people, it is published exactly as it is and reflects those people's opinions rather than London Bilingualism's editorial stance.
We strongly advise all readers to consult a qualified medical professional before acting on any medical, health, dietary, or pharmaceutical information found on this website. Since every person's health situation is different, only a qualified healthcare provider who is familiar with your medical history can offer you advice that is suitable for you.
In a similar vein, any legal, regulatory, or compliance-related information found on this platform is provided solely for informational purposes and should not be used without first obtaining independent legal counsel from a licensed attorney.
You understand and agree that London Bilingualism, its editors, contributors, and affiliated parties are not responsible for any decisions made using the information on this website.
