Multi-Layer Dictionary (2016)

Multi-Layer Dictionary (2016)(learnthesewordsfirst.com)

222 points by david_ar 6 years ago | 109 comments

Animats 6 years ago |

The 60 words:

to see, saw, seen. thing, something, what. this, these. the other, another, else, is the same as, be, am, are, being, was, were. one of. two of. person, people. many of, much of. inside. not, do not, does not, did not, some of. all of. there is, there are. more than, live, alive. big. small. very, kind of. if, then. touch. far from. near to, in a place, someplace, where. above. on a side of, hear, heard. say to, said about. word. true.

raldi 6 years ago | |

How would you define left and right based on these words?

skeoh 6 years ago | | |

From http://learnthesewordsfirst.com/Lesson-12F.html#12-21

12-20. right.

[X is on the right side of your body.] = X is on this side of your body: Most people write using the hand they have on this side of their body.

[I use my right hand when I draw pictures.]

12-21. left.

[X is on the left side of your body.] = X is on this side of your body: Most people do not write using the hand they have on this side of their body. They write using their other hand.

[My child held my left hand.]

freyr 6 years ago | | |

The dictionary starts with those 60 words, like axioms in mathematics. Then it builds up a vocabulary on top of them.

Left is defined this way: [X is on the right side of your body.] = X is on this side of your body: Most people write using the hand they have on this side of their body.

So "on a side of", "people", "be/is", are included in those first 60 words. "body", "write", "hand", etc. are defined after the first 60 words, but before "right" and "left".

mirekrusin 6 years ago | | |

lesson 12 covers left/right (360 words).

naikrovek 6 years ago | | |

Why don't you follow the link and find out?

bloak 6 years ago | |

One of those words is not a word in British English.

(Why do I mention that? Firstly, perhaps it's a fun puzzle for non-native speakers of American English to identify the word. Secondly, it's surprising that a difference between British and US English is apparent in such a short list of such basic words, considering that sometimes it's possible to write whole paragraphs of English without it being apparent which variety of English is being used.)

mrob 6 years ago | | |

That word is informal even in American English. It's less common in British English, but it's growing in popularity in both versions, and I wouldn't call it "not a word" even in British English. But I disagree with its inclusion in a first lesson, because its main use over the more common standard alternative (rot13: "fbzrjurer") is signaling casual speech.

Google Ngram Viewer lets you compare popularity of words in British vs. American English, so it's useful for investigating this.

https://books.google.com/ngrams

spicerguy 6 years ago | | |

British English speaker here - I'm having difficulty identifying the word you're referring to here. Or are you referring more to definition and statistical presence in common usage?

pbhjpbhj 6 years ago | | |

I know the word you mean (no spoiler tag, so I won't say it, just 'it could almost be German' {to use a linguistic stereotype}) but I wouldn't recognise it as _not_ en-gb, just unusual.

I'm en-gb native.

vivekf 6 years ago | | |

I would say 'much of'. I haven't seen it until I visited Americas.

flukus 6 years ago | |

I'd love to see if there's an interesting relationship between these words and the PIE language (https://en.wikipedia.org/wiki/Proto-Indo-European_language)

dspig 6 years ago | | |

I guess not, as this is optimized for defining words, not communicating

peterkelly 6 years ago |

If you like this, you'll definitely enjoy the talk "Growing a Language", by Guy Steele:

https://www.youtube.com/watch?v=_ahvzDzKdB0

nalzok 6 years ago | |

I was waiting for him to share thoughts on the failure (?) of Lisp, which I believe is a decent "shopping mall", but unfortunately he did not :(

dougb5 6 years ago |

This is very interesting -- a kind of topological sort of the dictionary.

It seems like a very natural thing to want to do with subject-specific glossaries as well. Often when I approach a new topic or hobby I want a glossary of all the jargon up front, and I want the words ordered from least to most demanding of in-knowledge.

ErotemeObelus 6 years ago | |

I am a newbie to topology, so please help me. What is a topological sort and why does it describe this?

ww520 6 years ago | | |

The "topological" in topological sort is more related to "network topology" than the mathematical topology (open sets). "Sort" is related to ordering. Topological sort thus is related to the ordering of a network graph's nodes by their edges.

You can see https://en.wikipedia.org/wiki/Topological_sorting to see what it actually is. Topological sort is good in dealing with dependency graph. It can turn a dependency graph into a linear ordering of nodes.

GP mentioned topological sort because words depend on other defining words and it's one big directed acyclic graph. Do a topological sort on it and you got a linear list of words ordered by dependency. Group the consecutive words that have no dependency together and you got the word layers. Within each layer all the words don't depend on each other. The words in one layer depend on the words in the lower layers.

Robin_Message 6 years ago | | |

Not sure why it's called topological sort or what it had to do with topology, but it's a graph traversal where predecessors are visited before their successors. Such a sort only exists if there are no cycles in the graph.

veridies 6 years ago |

I'd like to see research on this. I have an MA in TESOL and teach ESL, so this is very within my field. While some of what's happening here is basically a self-guided version of classwork, a lot of it seems to rely on very logically precise understandings (defining 'flat' as the shape of unmoving water). I can say from experience that students really struggle with being given a single example like that, even when they know all the words being used; it's just not how most people think. Visual aids and/or multiple examples are pretty essential, and often it requires watching a student to see what's registering and what isn't.

kd5bjo 6 years ago | |

It also falls into the trap of treating the most common definition as the only one. When you look up left/right in here, you'll find positioning, but nothing about liberal/conservative opinions or a legal guarantee (the right to X) or departing (he left the train station).

dejawu 6 years ago |

These are also known as Semantic Primes: https://en.wikipedia.org/wiki/Semantic_primes

phonebucket 6 years ago |

If true, this is interesting from an academic perspective: word meanings can be derived from a space of 60 dimensions. But I’m not still not convinced of the value with respect to language learning.

Learning how language is spoken from the fundamental 60 words sounds like trying to learn mathematics from its fundamental axioms. It seems like you might just get caught in a long list of definitions where you might be faster off trying to internalise some higher level useful concepts first.

scarejunba 6 years ago | |

It pushes some things out of the language and into the environment. If humanity went extinct and we had only this dictionary it is unlikely that word meanings would be interpreted to be the same as they are today.

But it's intended as a learning tool and it'll do fine for that.

trox 6 years ago | |

> If true, this is interesting from an academic perspective: word meanings can be derived from a space of 60 dimensions. But I’m not still not convinced of the value with respect to language learning.

More like 60 x N , since each of those words can appear arbitrary many times.

dbmueller 6 years ago | | |

Monoid with 60 generators. But is it free?

tom_mellior 6 years ago | |

> a space of 60 dimensions

I don't think that's true. How would a 60-dimensional vector capture all the syntactic relationships between the words in a complex sentence?

phonebucket 6 years ago | | |

Yeah, I was sloppy and wrong with my wording. Thanks for pointing it out.

I meant a sequence of one-hot encoded vectors of length 60 could make a good model of English.

grenoire 6 years ago |

Has there been any attempts to 'translate' this into other languages? I'm struggling most often with vocabulary first, whereas the grammar is much easier for me to grasp (programming helps?)

grooomk 6 years ago |

Reminds me of the Natural Semantic Metalanguage approach in linguistics by Anna Wierzbicka, Cliff Goddard and others. Nice to see, that there is actually quite some overlap both in quantity as well as in the actual words.

david_ar 6 years ago | |

It's not a coincidence: http://learnthesewordsfirst.com/about/research-behind-the-di...

perfunctory 6 years ago |

This reminded me of the minimalist constructed language Toki Pona. As the author herself puts it - " It was my attempt to understand the meaning of life in 120 words."

https://tokipona.org/

https://en.wikipedia.org/wiki/Toki_Pona

harperlee 6 years ago |

Just 60 words plus a huge context shared with the reader through living a human life (so not a lot of hope of feeding this into an algorithm and having it do anything resembling understanding).

Or perhaps with thorough explanations something like this could help bootstrap understanding by a machine?

inetsee 6 years ago |

This reminds me a little of "English Through Pictures" by Richards and Gibson, and "English Made Easy" by Crighton and Koster. Both use images to provide concrete examples for those words that can be illustrated visually.

crazygringo 6 years ago |

It's conceptually interesting, but it also strikes me as a problem that doesn't need solving.

Having worked with and taught foreign language, nobody learns the first 1,000 words of a language from a same-language dictionary, nor should they.

Children learn from the world; adults learn from classes or a translating dictionary. (Only intermediate/advanced level learners start to use a native dictionary.)

The idea of "bootstrapping" language knowledge from a single dictionary just... isn't going to be necessary for anyone?

abecedarius 6 years ago | |

I didn't start from zero, but this children's dictionary of French in French was useful to me learning it: https://www.amazon.com/Mon-premier-dictionnaire-Roger-Pillet...

All the words it uses are defined within it -- of course with some circularity, but it's heavy on examples and pictures. It was intended for nonnative children taking classes in a style more like native immersion than is typical in schools. I wish more resources followed this philosophy.

vidarh 6 years ago | |

My first French class did dive into French long before we were at 1,000 words, and we were strictly not allowed to use anything but a same-language dictionary.

We of course did have the benefit of a teacher who would translate if absolutely necessary, but she also insisted on sticking to French in the lessons wherever possible from the very first day. It was far more immersive in my first year of French than e.g. my fourth year of German.

It seemed to work quite well.

kieckerjan 6 years ago |

Tangential, I have always wondered why compact or pocket dictionaries (the single language ones) contain the simplest words. If space if of the essence, why not skip the words that everybody knows (like "table" or "shoe") and use the saved space for difficult words? After all, those are the ones you are likely to look up.

mrb 6 years ago |

I suppose this answers https://news.ycombinator.com/item?id=19331307 "What's the minimum number of words you'd need to define all other words?"

kazinator 6 years ago |

There is nothing wrong with circularity in dictionaries, though. This is a solution in search of a problem.

It may be the case that the definition A happens to use some word B, in whose definition we find word C, whose definition uses A. However, that isn't really a problem, because these definitions are not simply substitutions of exactly one word for another. The definition of A uses numerous other words other than B, that of B uses words in addition to C, and C uses words in addition to A.

That is, the existence of cycles in definitions doesn't necessarily make the definitions irresolvably circular.

dvh 6 years ago |

ash =

When something burns and becomes many very small dry pieces that moving air can cause to move.

Kind of tree.

taoromera 6 years ago |

What about grammar? You need grammar to form the definitions so you need to teach it at some point.

One approach would be to mix in grammar bits into the word flashcards. Like:

Flashcard 1: word 1 F 2: word 2 F 3: grammar bit 1 F 4: word 3 ...

You could use the grammar bits provided by English Profile based on the CEFR (Common European Framework of Reference): https://www.englishprofile.org/english-grammar-profile/egp-o...

bloak 6 years ago |

Could someone add links to all the words in the definitions, linking to the definitions of those words?

I've seen that done for a different language, though in that case the fully connected component had more than 60 works. I think it was more like 120.

Of course it doesn't make sense to do this competitively because it's so unclear what counts as an adequate definition.

I'm not sure this sort of dictionary would help me learn a language: I think probably not much. But it's definitely fun in a philosophical way.

leecarraher 6 years ago |

i once solved a similar problem with set theoretic approach using matroid theory and the greedy method for constructing a matroid basis using a thesaurus as my independence oracle. This was over a decade ago, so I no longer recall the results, but it certainly seems similar in the goal of finding a primal set of words that can in some way define all others.

murat124 6 years ago |

What a great way to learn a new language. Would pay for another language of this.

waksana 6 years ago |

this is great because: 1, I can review words by learning the next level 2, I can use english immediately no matter which level I am at. because at any level, I have the ability to explain anything