Standard Hindi derives much of its formal and technical vocabulary from Sanskrit. Standard or shuddh (“pure”) Hindi is used only in public addresses and radio or TV news, while the everyday spoken language in most areas is one of several varieties of Hindustani, whose vocabulary contains words drawn from Persian and Arabic. In addition, spoken Hindi includes words from English and other languages as well.
Vernacular Urdu and Hindi share the same grammar and core vocabulary and so are practically indistinguishable. However, the literary registers differ substantially in borrowed vocabulary; in highly formal situations, the languages are barely intelligible to speakers of the other. Hindi has looked to Sanskrit for borrowings from at least the 19th century, and Urdu has looked to Persian and Arabic for borrowings from the eighteenth century. On another dimension, Hindi has been associated with the Hindu community and Urdu with the Muslim community.
There are five principal categories of words in Standard Hindi:
Similarly, Urdu treats its own vocabulary, borrowed directly from Persian and Arabic, as a separate category for morphological purposes.
Hindi from which most of the Persian, Arabic and English words have been ousted and replaced by tatsam words is called Shuddha Hindi (pure Hindi). Chiefly, the proponents of the so-called Hindutva (“Hindu-ness”) are vociferous supporters of Shuddha Hindi.
Excessive use of tatsam words sometimes creates problems for most native speakers. Strictly speaking, the tatsam words are words of Sanskrit and not of Hindi—thus they have complicated consonantal clusters which are not linguistically valid in Hindi. The educated middle class population of India can pronounce these words with ease, but people of rural backgrounds have much difficulty in pronouncing them. Similarly, vocabulary borrowed from Persian and Arabic also brings in its own consonantal clusters and “foreign” sounds, which may again cause difficulty in speaking them.