-- Whose pot did the Croats, Bosnians and Slovenes piss in to not make it into this super Slavic union?
-- China Mieville wrote a book[0] along very similar thought lines which won the Locus Award.
Also, Garkavenko appears not to have taken the obvious side [1] in Ukraine's present conflict given how he is described in Foer's article
[0] https://en.wikipedia.org/wiki/Embassytown
[1] http://maidantranslations.com/2014/06/24/russian-volunteers-...
I wonder if a similar approach could be taken with language construction. Instead of spending 25+ years fleshing out the details of a language in painstaking detail, computer programs could be devised that, using large amounts input, determine the most "efficient" means of expressing information. The approach would not only be far less labor intensive, it could also accommodate the rapidly evolving nature of language, for example adding to its "dictionary" in response to new phenomena in need of naming.
You could perhaps use a typological database with grammatical features of the world's languages and somehow select an "optimal" combination from it, but that's a far cry from letting a computer determine the most efficient means of expressing information; we have no idea how to define information/meaning, so that it's still an impossible dream. I don't think the problem is that designing languages is hard per se, it's that people can't be bothered to agree on one and learn it.
Seriously, though, take a look at the link I posted in https://news.ycombinator.com/item?id=8180924
One of the techniques used is to computationally create a space of possible ways to partition semantic domains on a plane whose dimensions are simplicity and informativeness, in order to look at where in the possible space it is that real languages lie. While it's not been done (to my knowledge) for a whole language, it's potential direction to go.
I worked in his lab on one of many projects showing that most human languages use a near optimal trade-off in various semantic domains (so far - color, kinship, containers, and spatial relations). His work also includes some of the best evidence for some language dependent forces in cognition interacting with some universal ones.
I don't know much about designing human languages, but I know how hard it is to design a decent programming language (see http://colinm.org/language_checklist.html), and building a serious human language seems orders of magnitude more difficult. I've never seen an attempt that really intrigued me until I found Ithkuil.
Interesting. How is one supposed to talk about math without a concept of zero?
This has also happened with the language Lojban[1], which was 'forked' from Loglan[2] when the creator starting making copyright complaints so the community could maintain control.
Such an odd concept that someone could 'own' a language, but I guess if you created it I can see why you would want to.
[1] http://en.wikipedia.org/wiki/Lojban [2] http://en.wikipedia.org/wiki/Loglan
I seem to remember Umberto Eco mentioning that he does not offer his own interpretations of his novels for a similar reason, but I can't find the quote.
I did get a few somewhat weird emails that I think were in Russian some years ago, but I think they figured out pretty quick that it wasn't the right email address to reach Quijada.
If this invented language were to catch on, it likely wouldn't be a generation or two and kids who grew up speaking it would start saying the Ithkuil equivalent of things like "yo dog, that's the rad shizaz!". Then, several generations thereafter grandmothers would be regularly using the word "shizaz" and they would have to put it in the dictionary. That's just the way it goes and is probably the reason we don't all speak the same language in the first place.
That being said, I've always been fascinated by the idea of a systematically created universal language and think the world would be much better place with one....if that were possible.
This was a neat article.
IMO this suggests the bottleneck is something about our brains on a biological rather than linguistic level.
This does seem to suggest that biology may be the limiting role in controlling the rate at which humans convey information. Indeed, the language mentioned in the article seems almost laughably cryptic and dense. However, I feel that the limitation of the mentioned study results from the fact that it treats information on a relatively limiting per syllable basis. Quijada seems to suggest that an artificially constructed language has the ability to incorporate all the implicit meanings of a phrase that are left unsaid in normal conversation.
Ultimately, while Quijada's project seems quite unlikely to catch on among those who are not fringe pseudoscientists, it poses interesting philosophical questions about the nature of speech and communication and perhaps earns its title as a "conceptual-art project."
[1] http://rosettaproject.org/blog/02012/mar/1/language-speed-vs...
I think you mean the inverse.
I'm curious about the desire to reduce ambiguity, which seemed to be emphasized as a motivation for the creation of Ithkuil and some of the other languages mentioned.
Is it desirable to completely eliminate ambiguity? I can see why it would be desirable in a scientific paper or a public political debate. But in everyday interactions, (intentional) ambiguity plays many important roles.
In my experience, politeness is bolstered by some level of ambiguity. Rather than explicitly state your needs, desires or opinions, you imply them at some level of abstraction, allowing other participants in the conversation to accept or decline more easily. Imagine Jessica who has brought two friends who don't know each other to see a play. They chit-chat a little afterwards, then Jessica goes home early leaving two virtual strangers to have a drink together. It's not hard to imagine the conversation going like this:
A: "Did you enjoy the play?"
B: "It was very interesting. I thought the stage dressing was a little unconventional."
A: "Yes, I noticed that too. Very creative. I was intrigued by the style of the narration. It really let the audience write the story for themselves."
B: "It certainly didn't constrain the imagination did it? I couldn't help noticing that many of the actors took a somewhat avant-garde interpretation of the source material."
A: "Yes, as if they didn't want it to seem like they were 'acting', so to speak?"
B: It was awful wasn't it!?
A: Thank god! Yes, worst thing I've ever seen!
Ambiguity allows subtle social cues (not so subtle in my example!) that avoid direct confrontation when it might be uncomfortable. If one person loved the play and the other hated it, they each might want to avoid offending the other.
Intentional ambiguity plays an important role in other social interactions like dating or friendship-making. Correct use of ambiguity protects feelings, demonstrates subtlety and good judgement, and avoids non-productive conflict.
In artistic expression too, ambiguity is often intentional or even necessary to the effectiveness of the work. Consider a poem like "My Papa's Waltz" [1]. Does it describe happy memories of the narrator's father, or dark memories of childhood abuse [2]? Can it describe both? Is there something in between? The ambiguity isn't a byproduct of imprecise language. The ambiguity is the meaning. To resolve it is to remove the point of the work. The poem cannot be effectively communicated in any medium that does not allow for the existence of ambiguity.
[1] http://www.poetryfoundation.org/poem/172103
[2] 'Yet, this poem has an intriguing ambiguity that elicits startlingly different interpretations. Kennedy calls it a scene of "comedy" and "persistent love", and Balakian, in part, labels it a "comic romp" (62). In contrast, Ciardi sees it as a "poem of terror"' - from http://www.mrbauld.com/exrthkwtz.html
This is amazing. But I can't grasp the difference between inference and conjecture - they are both 'figuring out' what happened rather than knowing or hearing?
I'm in awe of the creator of Any language. Because to create a (Good) language isn't easy. This is true or both programming languages and otherwise. However, it comes without saying that adoption is a vital component of any language, and with mass adoption comes evolution.
People will often make changes in languages, make their own dialects (based on things perhaps the can relate to on a deeper level, etc..). This isn't a bad thing. To me it only signifies growth and expansion of the language.
+1
[1] http://www.reddit.com/r/tokipona [2] http://www.reddit.com/r/lojban
Loglan is a predecessor and inspiration to Lojban.
Slovio is the Slavic version of Esperanto.
Dothraki, Elvish (Quenya, Sindarin), Klingon, Na'vi are constructed languages from popular novels/movies.
Someone doesn't understand the point of Esperanto.
> A sentence like “On the contrary, I think it may turn out that this rugged mountain range trails off at some point” becomes simply “Tram-mļöi hhâsmařpţuktôx.”
Wikipedia:
> Romanization: Oumpeá äx’ääļuktëx.
> Translation: "On the contrary, I think it may turn out that this rugged mountain range trails off at some point."
I mean, things like:
> A sentence like “On the contrary, I think it may turn out that this rugged mountain range trails off at some point” becomes simply “Tram-mļöi hhâsmařpţuktôx.”
Simply?
We could have used LZW algorithm and the sentence could probably become even shorter, just a "simple" sequence of random-ish bytes. If you increase the number of allowed symbols, of course you need less symbols to convey the same information. If you allow for a limitless set of words that are dynamically generated from combining many roots, of course the number of words decreases... sometimes down to 1, as in polysynthetic languages. This is Information Theory 101.
Compressed with zlib: 86 bytes.
Compressed with lzma: 98 bytes.
The Ithkuil representation is just 30 UTF-8 encoded bytes.
Compressed with zlib: 39 bytes.
Compressed with lzma: 47 bytes.
(Measured using python's zlib/pylmza modules to avoid e.g. file header overhead)
It's hard to achieve this kind of compression without an external dictionary. What Quijada has created with Ithkuil is, in part, a dictionary for the space of human thought and concepts, something I wouldn't have expected to work in the way the article describes it.
I'm having trouble figuring out what's going on with lzma because the spec is lying about the header, so I won't attempt to guess the correct number there.
Just enjoy it.
Moreover, people communicating are imperfect. So if you have a language which is very precise and concise, you would have to spend a lot of effort to find a word or set of words which exactly expresses your meaning (in programming, we call it design when we do it upfront, and debugging when we do it post factum) and communication would be a very complex exercise. However, if you have a lot of words which mean roughly the same, you can be sure the meaning is passed through even if the words are not chosen super-carefully.
"Hegelochus, the actor in Euripides' Orestes, which was presented in 408 BC, in line 279 of the play, instead of "after the storm I see again a calm sea" (galeén' horoo), Hegelochus recited "after the storm I see again a weasel" (galeên horoo)."
This is the whole reason we are able to make ourselves understood in a noisy, imprecise world. Even if you miss a few syllables - or even half a sentence, you can usually piece together what the other person was trying to say.
Imagine someone technology-illiterate trying to describe a problem they're having with their computer in this language. Impossible.
Language should be useful and expressive for its users. Human languages designed by the kind of people who prize simplicity and regularity above all other qualities tend to fail for much the same reason programming language designers are hurt and disappointed to discover C is still popular.
Which is good for scientists and logicians, but would probably be an issue for novelists and poets. Exploiting ambiguity is a common feature of the arts.
Personally, I don't have much faith in the idea that you can make people's communication more precise and concise by designing the language in a certain way, just as I don't think that politically correct language leads to meaningful change. This is simply because I don't believe than language can drastically change the way people think (linguistic determinism, strong Whorfianism). Esperanto is very similar to natural languages in its (im)precision, which might just be the level where humans naturally converge to if left to their own devices.
And apq’uxasiu for 'gawk' doesn't strike me as a particularly huge win for conciseness.
It seems like it's missing the linguistic equivalent of huffman coding: make frequently used things shorter.
Also, it's hardly concise with respect to time if you have to spend 30 minutes consulting a dictionary to utter a single sentence.
http://www.sciencedirect.com/science/article/pii/S0010027711... (Sorry for the paywall...)
So maybe Ithkuil isn't what a language should be after all. And maybe that's why no natural language looks like it...
In the first case you have privileged one group, but in the second everyone loses ... In the second case you create an additional barrier because there is a large amount of new vocabulary that everyone has to learn, while in the other anyone who doesn't know European languages will have to learn some of its vocabulary, but that might be useful to them anyway, so it clearly seems like the better option to me.
I speak some Esperanto but I'm no zealot. The "explanation" that your neighbours don't speak Esperanto, if such a simplified thing can exist, is probably as simple as network effects. I can learn German and speak to the man next-door, or I can learn Esperanto and speak to some theoretical people that may exist somewhere but I don't know them and they are mostly a bunch of nerds that meet at the local co-op to speak in Esperanto mostly about how great Esperanto is.
Go ahead. Ask your neighbour why he doesn't speak Esperanto. Does he say " The 'basic' number‐terms tri, trio, tria ('three, threesome, third') are a crowded jumble, making a mockery of the regular root/noun/adjective pattern they imitate" (K5 in the article)?
Or does he say "what's that?"?
* the language has an excessive number of phonemes
me: On this point I disagree strongly. A language with more phonemes can form shorter words and convey information faster. that's just basic math (43^9 >> 16^9)
* the vocabulary is too large to be practical and has dubious links to other languages. specifically, Basic English requires far fewer (>10x fewer!) words for competence and is more recognizeable due to worldwide borrowing.
me: Here I agree. The amount of memorization required is quite large and borrowing words doesn't make up for it. Interlingua took a better path, but it targeted the only continent that doesn't need it.
* synthesis mechanisms are irregular and insufficiently general. in particular, esperanto's semantic structure and word synthesis fail to allow the speaker to compensate for missing vocabulary by using compound words or overly general words with attached descriptors
[I don't understand this sort of thing, but it sounds serious]
* the alphabet is needlessly complex. it uses symbols people do not recognize and cannot type in favor of ASCII digraphs with wide recognition.
me: Also, it does not accord significant importance to the way that an orange bikeshed would clearly clash with the trees in the background.
Okay, fine, I agree. This is just putting the nail in the coffin at this point.
* the noun-to-verb-to-adjective declension is not compatible with the structure of meaning, and so cannot be extended in a consistent, predictable way. that is to say, the variety of possible verbs is such that it is basically undecidable what noun should correspond to a given verb in the general case, and vice versa.
me: I don't know this but as far as I can tell he's right. For example, what is the "verb" associated with "electron"?
* the language is inherently sexist.
me: This is not a trivial criticism.
So there are some substantial criticisms in the article, methinks. And you can't discount the impact of difficulty on network effects: if one person has too much trouble learning Esperanto, they're not going to pass it on to their friends, who will not pass it on to their friends.
I wonder if a language designed to be usable with as small a core as possible -- something the author suggests -- could have a substantially better chance of catching on? Perhaps if it were also good at borrowing words from other languages ("extensible"), and grew in the right community for a little while...
Whenever a post about constructed languages comes up, you post this link, which I find disappointing: Rye's rant is, well, just a rant and his reasons for not learning Esperanto are just bad.
There was a time when I spent a few months learning Esperanto. I eventually gave up because 1) whilst it is relatively easy to get going with Esperanto, speaking it comfortably would require about as much work as any other language and 2) the Esperanto community is made up out of folks that are either much older than me or a little strange.
I don't think my first criticism is a fault of Esperanto. Human languages require a lot of convention and shared cultural ideas in order for communication to be compact and, at the same time, clear. No language, auxiliary or otherwise, can magically remove these requirements.
The second criticism is perhaps a little unfair but still important. Cultural cachet is important for a language and a language that appears to cater only to certain groups is going to have a tough time.
But if someone is keen to learn a new language, especially if that person is a monoglot, they should be encouraged to give it a try, even if their language of choice is Esperanto. They can only improve their linguistic skills.
""" During recent travels to Spain, I had the opportunity to observe participants in a pan-European seminar on youth and globalisation. While English was the default language of this group, in conversations between any two people the participants would often switch to the native language of one or the other. For example, a young man from France would greet another in English, but upon discovering that his conversation partner is from Italy, would switch to Italian. This would not find approval among Esperantists. Ironically, English proves the neutral choice here. It is often seen as a sure bet for international communication among young people in many countries, but it is well understood that other languages may serve just as well. In the Esperanto movement, on the other hand, there is an ideological attachment to Esperanto which mandates its use even if there are other, more culturally rich possibilities. """
My experience was at large international demo-parties. I mainly noticed this for English and German, but that's because I speak those two languages relatively fluently (Dutch is my mother tongue). The same must have been happening around me for French or Spanish (or who knows what else) as well, but I don't speak those languages well enough to tell for sure whether they were using (say) Spanish as a common bridging language or were native speakers.
And yes, English is usually the sure bet. Although some of the comments elsewhere ITT have piqued my interest to perhaps learn some Spanish in the future, especially if it's really that easy to learn. Funny, I used to hate learning language (French and German) in high school. It's only later in my life that I found out I actually have quite a knack for it :) (I should probably blame the way it was taught, but I can't really get that much worked up about it, I'm pretty satisfied with the quality of my education overall)
As an Esperanto speaker, I have no objection to anyone using any language on any occasion, and I'm happ. I have just returned from an Esperanto conference in Dinan, Brittany. I was able to use some basic Breton with a few individuals, but the sky did not fall on my head.
I wish you well with your language learning. You may wish to add Esperanto one day.
Still, I'd say Esperanto is an interesting case because it is undeniably the most successful constructed language, with the longest continuous (still continuing) history. It is interesting to learn it for that reason alone, and as far as learning second languages go, it is also one of the easiest to learn. That could make it a useful first step towards learning other second languages.
Why hasn't Esperanto caught on with more speakers? Maybe it has something to do with the reasons in his rant; personally I find it much more likely that it's due to network effects. Even a "perfectly" constructed language would be very unlikely to win over the vested interests and inertia of people.
"There's already an existing language that fulfills so many of those criteria that it's going to be very hard to organize a new one from scratch. And the existing language already has a deep legacy of literature and culture.
That language is the world's second most widespread: Spanish.
Spanish has been destroying the dreams of Esperantists and others over the years who hope to build a more regular, orderly, and easy to learn common language based on common Indo-European roots.
Turns out that it's dang hard to design anything easier or more accessible to speakers of any European language than Spanish already is. The spelling and pronunciation are already completely regular and predictable. The grammar is straightforward and common to almost all European tongues. The vocabulary is mostly based on Latin with some Arabic variety thrown in, but it's been standardized over the centuries so that a lot of it has a simpler and more natural morphology. The sounds are a simple subset of what most languages already use.
It's a great second language: it's fairly easy, the world's second most widespread tongue, and spoken in warm countries with very friendly natives. It's not likely to provide you with many lucrative business opportunities, though. None of the world's financial capitals use it."
(lavar (present lavo lavas lava lavamos lavan) (past lavé lavaste lavó lavámos lavarón) (imperfect lavaba lavabas lavabamos lavaban) (future lavaré lavarás lavará lavaramos lavarán) (conditional lavaría lavarías lavaríamos lavarían) (present-subjunctive lave laves lavemos laven) (gerund lavando) (participle lavado))
And that's ignoring the twenty or so irregular verbs and roughly ten "irregular" patterns (e.g. querer). It's a lot simpler than French or Italian, to be sure, or Latin, for that matter, but it could be a lot easier.
Steven Pinker explores this point in an entertaining presentation[1] for RSA. He also covers other language topics such as spacetime encoding, and profanity but the last part analyzes the need for ambiguity in a language. I deep-linked into the relevant portion of the presentation although the the entire talk is very enlightening. It's worth rewinding to the beginning to watch the entire talk. The 2nd youtube video[2] is the mostly the same material but it's the older one he presented at Google TechTalks.
This is a bit speculative, but I have the feeling there is a certain kind of personality that is typically fascinated with this idea of being able to rule out ambiguity. It seems to me a tendency to overextend the idea of mathematical rigor to other areas of life.
The Search for the Perfect Language - Umberto Eco - 2010
There are other factors at play, such that speech perception is multi-modal. E.g. see:
Mark 10:25 (and parallel versions in Matthew and Luke) has prompted much speculation over the centuries with regards to the origin of it's evocative metaphor: "It is easier for a camel to go through the eye of a needle than for someone who is rich to enter the kingdom of God."
But when you consider that the word for camel (kamêlos) and for rope (kamilos) differ by only one vowel, quite a mundane explanation springs to mind: someone in the early church misheard, misspelled, or mistranslated Jesus' original admonition.
The more satisfying explanation, the one that I prefer, is that this is a pun that happens to have gotten lost in translation.
The few comments in this blog article offer some interesting explanations:
http://rambambashi.wordpress.com/2010/06/03/common-errors-36...
If you were designing a language from scratch you also wouldn't choose features like the unnecessary grammatical gender (even though it's relatively consistent and easy to get right) and the b/v distinction in the orthography that's unpronounced in most dialects.
On paper it's still a far, far better auxiliary language than English though (I wouldn't be surprised to live to see a day when most people worldwide speak a regularised English with much more basic grammar and sensible orthography one day though; it's easier to build momentum taking the second language people are most exposed to as the starting point)
I wish it would be as easy to prototype human languages as it is for software...
Learning any foreign language will tend to do that, though. It's a function of being forced to consider your native tongue as a language, rather than just speaking it. Plus if you learn a foreign language, you will have the advantage of being able to talk to an established base of people who speak it.
I think Esperanto is a fun hobby and I have fun toying with it every few years, even if I'm under no delusion that it's a practical way to, say, do international business. I do think that if it or any universal second language (French, English, Mandarin, whatever) were to suddenly become more widespread, the world would be better off.
My claim was only that the reason that it's not more widespread has nothing to do with subjective flaws in the language itself.
And probably the same response after you patiently explained them what CP/M (or Esperanto) was: "Why the hell should I care about that? Give me something that works with Windows (or Mandarin) so I can work with a sizeable portion of the rest of the world !!!"...
Shorter words, yes. Convey information faster - only if you have perfect speakers and perfect listeners. Otherwise, in the best case you'd just have people ask - did you just say "hélló" or "hëllò"? In the worse case, they'd just misunderstand what you're saying. And since you strive to eliminate redundancy, instead of nonsense which would trigger request for repetition, you'd get meaning - but meaning something else that you didn't intend to say.
Perhaps I didn't include enough context from the document. He cited Esperanto as containing 34 phonemes, compared to English at 43, and then a list including Andean Spanish with 17, Japanese with 14, Hawai'ian with 8, or Rotokas with 6. That is, he cited (what seem to be) outliers with the smallest number of phonemes and implied that these were desirable. Later he notes that Eastern Polish has 49 phonemes.
I agree that a language can certainly have phonemes which are difficult to distinguish (Mandarin Chinese is famous for this) or hard to pronouce (the voiced "th" in English "the" is famous for this) or inscrutable for some listeners (most English speakers cannot distinguish dz from d), but I thought his minimalism was too extreme.