DOI: 10.3390/languages11070141 ISSN: 2226-471X

Documenting Environmental Knowledge in the Bahnar Language of Vietnam

K. David Harrison, Hoài Trần, Công Minh Khang Hoàng, Nghĩa Đ. Nguyễn, Hải Lâm Cao, Xơm A, Lisa Lim, Myles L. Lynch, Thuy Bui

Environmental knowledge encoded in Bahnar, an Indigenous language of Vietnam, is vital to the Bahnar community and contributes to broader understandings of biodiversity, climate resilience, and sustainable lifeways. We describe a collaborative approach to documenting Bahnar that integrates computational methods with ethnographic, lexicographic, and linguistic fieldwork. Because Bahnar knowledge is transmitted almost entirely through oral tradition rather than writing, effective documentation cannot rely solely on extractive corpus-based or NLP tools. Although three legacy bilingual Bahnar dictionaries exist, they are partially obsolete, uneven in coverage, and largely inaccessible to the community itself. Our corpus analysis of the Bahnar environmental vocabulary, complemented by intensive community-based fieldwork, reveals semantic patterns that closely link environmental knowledge with Bahnar lifeways, subsistence practices, and material culture. These patterns, we argue, are language-specific and may not emerge from analyses of environmental lexicons in languages such as English or Vietnamese. Bahnar semantic categories attribute aesthetic, medicinal, mythological, and spiritual agency to animals, plants, and landscapes, contrasting with classificatory frameworks common in post-industrial societies that emphasize biophysical, scientific, or economic properties. We propose that community-centered digital lexicography can strengthen Bahnar language vitality, enhance local access to cultural knowledge, and simultaneously advance comparative linguistic and environmental research.

More from our Archive