John
There are many many rules in Sanskrit like this and the combination of
characters (vowels and consonants) depends on grammar rules. This is
called SANDHI or joining. We can not change those rules for machine
interpretability, the machines have to be intelligent enough to decipher
combos.
Thanks. (01)
Ravi (02)
(Dr. Ravi Sharma) Senior Enterprise Architect (03)
Vangent, Inc. Technical Excellence Center (TEC) (04)
8618 Westwood Center Drive, Suite 310, Vienna VA 22182
(o) 703-827-0638, (c) 3132041740 www.vangent.com (05)
-----Original Message-----
From: ontolog-forum-bounces@xxxxxxxxxxxxxxxx
[mailto:ontolog-forum-bounces@xxxxxxxxxxxxxxxx] On Behalf Of John F.
Sowa
Sent: Tuesday, January 15, 2008 10:22 AM
To: [ontolog-forum]
Subject: Re: [ontolog-forum] (OT) German (06)
Pat's comment emphasizes the point that a significant part of the
difference between English and German is in spelling conventions: (07)
PH> ThewayItalkEnglishoftensoundslikethat, or sopeopleoftentellme. (08)
An example I like to use is (09)
Lebensversicherungsgesellschaftsangestellter (010)
If you remove the copies of the letter S between major constituents,
you get (011)
Leben versicherung gesellschaft angestellter (012)
and a word-for-word translation gives you (013)
life insurance company employee (014)
But, as Pat pointed out, most people pronounce this as (015)
lifeinsurancecompanyemployee (016)
Therefore, German is more helpful for the listener because it
inserts an extra sound to mark the word boundaries. (017)
To help the reader as well as the listener, they should
consider adding a blank as well: (018)
Lebens versicherungs gesellschafts angestellter (019)
or perhaps a hyphen: (020)
Lebens-versicherungs-gesellschafts-angestellter (021)
or maybe extend their practice of capitalizing all nouns
to include internal nouns: (022)
LebensVersicherungsGesellschaftsAngestellter (023)
This convention would also help people who are writing
morphological analyzers for German. One machine translation
program analyzed the following word (024)
Toiletteneingang (025)
which could be written ToilettenEingang for 'toilet entrance'.
But the program interpreted it as (026)
Toilette-nein-gang (027)
which it translated as 'toilet denial procedure'. (028)
Anyone who has participated in lots of meetings might consider
that a rather useful term. (029)
John (030)
_________________________________________________________________
Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/
Subscribe/Config:
http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/
Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
Shared Files: http://ontolog.cim3.net/file/
Community Wiki: http://ontolog.cim3.net/wiki/
To Post: mailto:ontolog-forum@xxxxxxxxxxxxxxxx (031)
_________________________________________________________________
Message Archives: http://ontolog.cim3.net/forum/ontolog-forum/
Subscribe/Config: http://ontolog.cim3.net/mailman/listinfo/ontolog-forum/
Unsubscribe: mailto:ontolog-forum-leave@xxxxxxxxxxxxxxxx
Shared Files: http://ontolog.cim3.net/file/
Community Wiki: http://ontolog.cim3.net/wiki/
To Post: mailto:ontolog-forum@xxxxxxxxxxxxxxxx (032)
|