BLACK mobile logo

international

Language documentation needs community rights, consent, and recognition: Interview with Van Gujjari writer Taukeer Alam

April 17, 2026

The OpenSpeaks Archives, launched in 2024, is a digital platform helping Indigenous language speakers cite oral knowledge on Wikipedia by documenting, transcribing, and archiving nearly 20 languages from India, Nepal, and Sri Lanka. Taukeer Alam, a conservationist and Van Gujjari speaker from India's nomadic Van Gujjar community, explains that audio and video formats better capture Indigenous languages than written text because they preserve tone, emotion, and pronunciation that books cannot convey. He emphasizes that documentation must be participatory, involving youth who can continue the work, and materials should be quickly shared back to communities in accessible formats before knowledge holders pass away.

Who is affected

  • The Van Gujjar community, a nomadic Muslim Indigenous group in Uttarakhand, India
  • Speakers of Van Gujjari and approximately 20 other languages from India, Nepal, and Sri Lanka
  • Wikimedians seeking to cite Indigenous oral knowledge
  • Community elders (60-80 years old) who hold traditional knowledge
  • First-generation college learners and youth from Indigenous communities
  • Children in these communities who are beginning to achieve literacy
  • Community-based language archivists

What action is being taken

  • The OpenSpeaks Archives is hosting nearly 20 languages from India, Nepal, and Sri Lanka
  • Community-based language archivists are documenting, transcribing, and archiving their languages
  • Taukeer Alam is recording and transcribing folk songs and conducting interviews with elders
  • Youth are being trained in documentation methodology and processes

Why it matters

  • This work addresses the critical gap between oral knowledge systems and written citation requirements for platforms like Wikipedia, enabling Indigenous knowledge to be recognized and preserved. Audio and video documentation captures linguistic elements—tone, emotion, pronunciation, and cultural context—that written text cannot convey, which is essential for languages with unique sound patterns. The urgency is particularly significant because knowledge holders are aging and passing away, taking irreplaceable cultural and linguistic information with them, including the meanings behind traditional songs and practices. The project also raises important questions about data sovereignty, community rights, and protection against exploitation through AI and commercialization of Indigenous knowledge.

What's next

  • No explicit next steps stated in the article

Read full article from source: Global Voices

Language documentation needs community rights, consent, and recognition: Interview with Van Gujjari writer Taukeer Alam