भाषा शाला

bhāṣā śālā

language house

An etymologically-informed reader for Indian language texts. Word-level transliteration, translation, and etymological connections across hundreds of Indic languages.

देवनागरी
Readable
IAST
Literal
W Wiktionary (Sanskrit)
Etym
W Wiktionary

Word-by-Word Analysis

Every word gets transliteration, translation, grammar, and morphology. Hover any word to see its full breakdown with provenance from multiple dictionaries.

*bʰ-

Origin Tracing

See where each word comes from — trace its journey from Proto-Indo-European through Sanskrit, Prakrit, and into modern Indic languages with interactive trees and maps.

क ↔ ક

1.9M Unified Lexicon

Cross-referenced dictionary compiled from Wiktionary (17 languages), Jambu/CDIAL (290K forms), Monier-Williams (286K entries), and Heritage Engine (928K Sanskrit forms).

AI-Powered Disambiguation

When a word has multiple meanings, an LLM (Llama 3.3 70B) reads the surrounding context to select the right sense — then caches it so no API call is ever repeated.

*bʰ-

Cognate Map

See where a word's relatives live — cognate forms across 615 languages plotted on an interactive map, powered by Jambu etymological data.

क ↔ ક

Fully Verifiable

Every meaning and etymology shows its source — dictionary badge, confidence score, and direct links. AI-generated glosses are clearly marked and cross-checked.

Built on 1.9M+ Entries of Open Linguistic Data

How Bhāṣā Śālā Uses These Sources

1

OCR

PDFs are processed with Gemini 2.5 Flash to extract text and word bounding boxes

2

Dictionary Lookup

Each word is looked up across Wiktionary (16 languages), Jambu/CDIAL (290K forms), and Monier-Williams (160K entries)

3

Disambiguation

An LLM (Llama 3.3 70B via Groq) selects the best sense for each word in context

4

Etymology Trees

Cross-linguistic cognate data spanning hundreds of Indic languages is assembled into interactive etymology visualizations