Breaking Barriers, Building Bridges: Increasing Language Representation in Southeast Asia

 

 

Held outside of Singapore for the very first time, the third Languages Summit was co-hosted by AI Singapore, Google and VISTEC in Bangkok, Thailand and brought together a passionate community of AI experts and researchers from all corners of the SEA region. This event, dedicated to building a more inclusive AI future, provided a platform for exchanging discussions and insights around efficient model training, obtaining and sharing high-quality data, and regional updates on AI.

Sharing successes and challenges: A regional perspective

  • Google Research showcased innovative model composition techniques like CALM and MatFormer and introduced the newest family of Gemini and Gemma models
  • AISG unveiled a new chapter with SEA-LION v2, Project SEALD, and SEACrowd
  • Regional collaborators updated on their progress with regional LLMs and specific applications and use-cases across Indonesia, the Philippines, Thailand and Vietnam

Building a Data-Rich Ecosystem with Project Aquarium
Project Aquarium, a community-driven data map and platform crafted exclusively for SEA-focused data will be launched soon! This initiative aims to fill gaps in the region, where high-quality data in certain areas is not easily accessible. If you are interested in contributing to Project Aquarium, please reach out to us here.

Roundtable discussions
Summit participants delved into the rapid progression of LLMs, identifying unique challenges and opportunities they bring to Southeast Asia.  A lively roundtable discussion explored data loopholes, copyright concerns, and the importance of community collaboration in AI development.

Roundtable Discussions

 

Summit Attendees
We are grateful to have had an incredible lineup of diverse experts and researchers at the summit!

  • Asian Development Bank: Samuel Ang
  • Ateneo De Manila: Jimson Paulo Layacan, Isaiah Flores, Katrina Bernice Tan
  • Bandung Institute of Technology: Ayu Purwarianti
  • CAIR: Erika Legara and Sebastian Ibanez
  • Chulalongkorn University: Ekapol Chuangsuwanich and Attapol rutherford
  • Data Science Singapore: Koo Ping Shung
  • GoTo: Ofir Shalev
  • Hanoi University of Science and Technology: Dinh Viet Sang
  • IMDA: Akriti Vij
  • IndoNLP: Alham Fikri Aji
  • KASIKORN Business-Technology Group: Thadpong Pongthawornkamol and Patcharin Areewong
  • KORIKA: Andreas Tjendra
  • Monash University: Derry Wijaya
  • NECTEC: Apivadee Piyatumrong, Thepchai Supnithi
  • SCB 10X Typhoon Team: Potsawee Manakul, Kunat Pipatanakula nd Kasima Tharnpipitchai
  • SMU: Lim Ee Peng and Ngo Chong Wah
  • Vidyasirimedhi Institute of Science and Technology: Sarana Nutanong, Wannaphong Phatthiyaphaibun
  • BGDI Thailand: Patipan Prasertsom

A huge thank you to all attendees for your enthusiastic participation at the summit! Your invaluable contributions are driving us toward a more represented AI for Southeast Asia.

Let’s write more stories together
Join us on an exhilarating quest to increase representation in Southeast Asia! Dive into the Project SEALD webpage to uncover the details of this groundbreaking initiative and find out how you can play a part. Let’s unite to create AI that truly mirrors the rich diversity and vibrant cultures of our region. If you are interested in participating in the next Language Summit, please contact us here. See you then!

Author