Go offline with the Player FM app!
#179 New Trends in Machine Translation with Large Language Models by Longyue Wang
Manage episode 374591073 series 2975363
Joining SlatorPod this week is Longyue Wang, a Research Scientist at Tencent AI Lab, where he is involved in the research and practical applications of machine translation (MT) and natural language processing (NLP).
Longyue Longyue expands on Tencent’s approach to language technology where they integrate MT with Tencent Translate (TranSmart). He highlights how Chinese-to-English MT has made significant advancements, thanks to improvements in technology and data size. However, translating Chinese to non-English languages has been more challenging.
Recent research by Longyue explores large language models’ (LLMs) impact on MT, demonstrating their superiority in tasks like document-level translation. He emphasized that GPT-4 outperformed traditional MT engines in translating literary texts like web novels.
Longyue discusses various promising research directions for MT using LLMs, including stylized MT, interactive MT, translation memory-based MT, and a new evaluation paradigm. His research suggests LLMs can enhance personalized MT, adapting translations to users' preferences.
Longyue also sheds light on how Chinese researchers are focusing on building Chinese-centric MT engines, directly translating from Chinese to other languages. There's an effort to reduce reliance on English as a pivot language.
Looking ahead, Longyue's research will address challenges related to LLMs, including handling hallucination and timeless information issues.
Chapters
1. Intro (00:00:00)
2. What is Tencent? (00:01:29)
3. Professional Background and Interest in MT and NLP (00:03:44)
4. Tencent's Interest in Language Technology (00:06:03)
5. Perception of Language Technology in China (00:08:42)
6. MT Quality for Chinese (00:12:01)
7. ChatGPT's Translation Capabilities (00:16:45)
8. Interesting Directions for MT Using LLMs (00:20:06)
9. Translation Memory-Based MT (00:22:51)
10. Interactive MT (00:24:05)
11. Using ChatGPT to Evaluate Translation (00:25:56)
12. Personalized MT and Multi-Modal MT (00:27:57)
13. The Focus of China-Based Research (00:30:35)
14. Future Research Initiatives (00:33:55)
214 episodes
Manage episode 374591073 series 2975363
Joining SlatorPod this week is Longyue Wang, a Research Scientist at Tencent AI Lab, where he is involved in the research and practical applications of machine translation (MT) and natural language processing (NLP).
Longyue Longyue expands on Tencent’s approach to language technology where they integrate MT with Tencent Translate (TranSmart). He highlights how Chinese-to-English MT has made significant advancements, thanks to improvements in technology and data size. However, translating Chinese to non-English languages has been more challenging.
Recent research by Longyue explores large language models’ (LLMs) impact on MT, demonstrating their superiority in tasks like document-level translation. He emphasized that GPT-4 outperformed traditional MT engines in translating literary texts like web novels.
Longyue discusses various promising research directions for MT using LLMs, including stylized MT, interactive MT, translation memory-based MT, and a new evaluation paradigm. His research suggests LLMs can enhance personalized MT, adapting translations to users' preferences.
Longyue also sheds light on how Chinese researchers are focusing on building Chinese-centric MT engines, directly translating from Chinese to other languages. There's an effort to reduce reliance on English as a pivot language.
Looking ahead, Longyue's research will address challenges related to LLMs, including handling hallucination and timeless information issues.
Chapters
1. Intro (00:00:00)
2. What is Tencent? (00:01:29)
3. Professional Background and Interest in MT and NLP (00:03:44)
4. Tencent's Interest in Language Technology (00:06:03)
5. Perception of Language Technology in China (00:08:42)
6. MT Quality for Chinese (00:12:01)
7. ChatGPT's Translation Capabilities (00:16:45)
8. Interesting Directions for MT Using LLMs (00:20:06)
9. Translation Memory-Based MT (00:22:51)
10. Interactive MT (00:24:05)
11. Using ChatGPT to Evaluate Translation (00:25:56)
12. Personalized MT and Multi-Modal MT (00:27:57)
13. The Focus of China-Based Research (00:30:35)
14. Future Research Initiatives (00:33:55)
214 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.