Is being bilingual only about the lexicon?

To start, each constructs bilingual dictionaries without the aid of a human teacher telling them when their guesses are right. That’s possible because languages have strong similarities in the ways words cluster around one another. The words for table and chair, for example, are frequently used together in all languages.

So how does the computer know that table and chair are often used together? What about cultures that do not have chairs, but do have tables?  How is the computer mapping co-occurances that have a significant variation by culture, or simply do not exist?

This article uses Chinese and Arabic as the example languages for mapping. The underlying cultural principles are rather different, for community, behaviors, constructs, and as these are mapping IN language, does one become more like the other? Does the machine create an Arabic with Chinese sentiments? [The papers use French and English, which are much more similar, culturally and linguistically.]

Treating language like math is not going to turn out well. Though I haven’t yet read the papers that support these assumptions.

Back to my regularly repeated statement: machine translation and language that does not address the significant cultural components of language, as communication, as culture, as transfer mode of ideology, will, in the end, create a different or new culture, and now would be a very good time to be paying more attention to this.

http://www.sciencemag.org/news/2017/11/artificial-intelligence-goes-bilingual-without-dictionary

 

 

An AI is an AI is an AI…or not.

Every morning, of late, when I read the news, there is a slew of headlines of what AI has done for us lately.

Just this morning, I read:

Robert Wickham of Salesforce is the source of the last statement, that AI will be the new electricity, once we are done oohing and ahhing. Or being afraid that we will all lose our jobs.

AI, however, is not like electricity. It is not so straight forward. While it may, eventually, be ubiquitous and unconsidered, so far we cannot provide a single and clear definition for what it is, and thus these reductive metaphors create greater confusion than clarity.

In each article ‘AI’ describes something different. Deep learning, neural networks, robotics, hardware, a combination, etc. Even within deep learning or neural networks, the meanings can be different, as can the nuts and bolts. Most media and humans use ‘AI’ as shorthand for whatever suits their context. AI, without an agreed upon definition, but the lack of clarity, differentiation, and understanding does make it very difficult to discuss in a nuanced manner.

There is code, there is data, there is an interface–for inputs and outputs, and all of these are (likely) different in each instantiation.  Most of the guts are proprietary, in the combination of code and data and training. So we don’t necessarily know what makes up the artificial intelligence.

Even code, as shorthand to a layperson, as the stuff that makes computers do what they do, is a broad and differentiated category. In this case, like language, it is used for a particular purpose, so this reduction is perhaps not as dangerous. We’ve never argued that code is going to take over the world, or that rogue code is creating disasters.  As compared to algorithms, a few years ago, and AI, now.

So much of this lumping is a problem? We lump things, such as humans or cats, into categories based on like attributes, but we do have some ways to differentiate them. These may not be useful categories, nationality, breed, color, behavior, gender.  (Even these are pretty fraught of late, so perhaps our categorization schemes for mammals needs some readdressing.)  On the other side, we could consider cancer, an incredibly reductive title for very a broad range of…well of what? Tumor types? Mechanisms? Genetic predispositions? There are discussions, given recent research, as to whether cancer should be a noun, perhaps it is better served as a verb. Our bodies cancer, I am cancering, to show the current activity of internal cellular misbehavior.

What if we consider of this  on the intelligence side, how do we speak of intelligence, artificial or otherwise? For intelligence, like consciousness, in humans, we do not have clear explanations for what it is or how it works. So not perhaps the simplest domain to borrow language from, and apply it to machines.  Neural networks is one aspect, modeled on human brains, but it is limited to the structural pathways, a descriptor of how information travels and is stored.

The choice to use AI to represent such a broad range of concepts, behaviors, and functions concerns me. Even in the set of headlines above, it is difficult to know what is being talked about, from a continuum of visible outputs to robots who speak.  If we cannot be more clear about what we are discussing it is incredibly complicated to make clear decisions, about functions, about inputs, about outputs, data, biases, ethics, and all the things which have broad impacts on society.

I haven’t seen clear work on how we should use this language, and though I worked with IBM Watson for a while on exactly this concern, I can’t say I have a strong recommendation for how we categorize not just what we have now, but, as importantly, what is being built and what will exist in the future. The near future.

I’ll work on this later, as in soon, ways in which to talk about these parts in a public context that are clearer, and allow for growth of creations into a systems model.  Check back!

 

Grammar checking software, language change, and precursors to the future

Almost 20 years ago I wrote my master’s thesis on ‘Grammar Checking Software as a Tool of Language Change’ or some such title. I’ve since lost track of the paper, it was, in that day, on paper.

I was studing at Georgetown University, and my work focused on language and power, from a sociolinguistic and cultural perspective. I had been using assorted early 90s log files I had collected, from IRC and a forum used at UCSC when I was an undergrad, and assessing markers of power in language in the online environment, watching the evolution of language change, and seeing the ways in which one positioned oneself, via language. One of the areas that particularly interested me, which I delved further into, is how non-native speakers of English marked authority online in an anonymous environment and using a language that was rapidly evolving, often different, in each community.

This work led to me to the early grammar and spelling checkers, and my often curiousity as to why they were so not grammatical. I decided to analyse the grammaticality of the current crop of tools against Fowler’s Modern English Usage. For those non-grammar nerds, Fowler’s is an early 20th century grammar text which is/was the go-to for proper English. As a comparator it had its issues, but those I worked around and wrote about. As I worked through analyzing different software programs, I eventually opted to use on Microsoft Word, due to the enormity of my task.

The outcome of this was that even at its most stringent, the grammar checker was no where near Fowler’s. And at its more casual levels — at the time it had three — the English it was recommending was so odd and so lax that the usage of the tool, in my estimation, would ‘teach’ a user a very different English than one would learn at school.

There was no way this was not purposeful, which led me to many questions, most of which remained unanswered, as MS did not wish to speak to me about them.  At the most basic, I was curious who wrote the program because I felt that no linguist would have built a grammatically incorrect system.  This led me to the hypothesis that the linguists built the systems and the marketers freaked out at all the wavy green and red lines and insisted it have fewer. I would love to have heard more of the inside of this, and if you happened to work on this, I’d love to hear from you.

This has really interesting ramifications for both Americans and non-native English speakers. In general, and in different ways, each can use a boost in grammar, and if you have an authoritative tool telling you that no, you cannot use ‘which’ refering to a cat, it must be ‘that’ then this is a possible shift.  It refuses the use of the passive voice, and other constructs it deemed overly complex. One cannot use a run on sentence nor a fragment.  It flattened, significantly, the ability to be creative in language.

We could say that one could turn this off, or that it could be ignored, but in the course of doing the research, I did eventually become bothered by all the wavy lines and want to change words to adhere to what I called Microsoft English.  Because it was, in fact, a significantly different and evolved English.  One that was being rapidly disseminated by what was becoming the most widely used word processor.

Whether or not Microsoft was attempting to create a new English, or was aware of the possible cultural ramifications, and power structures, that they were creating and/or re-inforcing, changes were happening.

And this was in one language. Years later I went back and re-ran some of my assessments in French, out of curiousity for the formality/informality and what it would recommend, and even in a language which has an Academie to control the language, there were shifts away from the rulings. So perhaps in most languages we see changes due to the judgments of software, and then these likely flow into society, because we do learn from software.

I doubt most people have considered this, that the language promoted by their grammar checking software is ‘wrong’ or at the very least, not a standard, until they created a standard.  I am not writing this, nor did I at the time, to be a stickler for Fowler’s or old grammar rules, but to surface the awareness that changes in the systems’ use of language flow outward into spoken language, and that they often have significant ramifications in how we can think about things. If your software systematically attempts to remove the use of animate pronouns for animals, plants and objects, it becomes of judgment of the humanity of any thing other than a human.   These are the things invisible to most people, yet significant in the ways in which we exist in our worlds.

In the same vein, I still look at the ways in which software is an output into our language, not just how language changes within power and control structures, but also the adoption of words, and the modification of our grammar, to make it easier to interact with the machines. More on this last bit later.

 

 

Temporal translation

I collect old dictionaries, in many languages, translations and otherwise. They are full of rich cultural information, new words, pathways, changed meanings, and I enjoy reading them for the glimpses of other worlds.

Often they contain words that I have to look up in other dictionaries, such as my copy of the first Hebrew-English translation dictionary released in Israel. It has so many words about the desert, about the plants, water, formations, growing, that I had to look a signficant number of them up in English, as I had never heard them.  A more modern Hebrew-French translation dictionary I have does not include nearly as many words of this sort.

I can build these models in my mind, in bits and pieces. But what would it be like to build them in the machine, to provide a rich view into different time periods by pouring in time-specific language data?

What if the machine can translate me to 1700s English? What if the machine translated from time periods, different Englishes, or Frenches? What about dialects?

I don’t know where phonology data would come from. What if I want to translate to Beowulf? How does the machine learn to pronounce the language properly?

I can imagine an amazing visualization, a time line, that I can drag into the past, to hear the sounds. Except it would need regional variation as well.

In the tradtion of vac, in which sound matters, the sound and the meaning intricately entwined, what histories can we learn by having the ability to translate to other places in time, not just other languages?

Do men dream of electric families?

In two recent articles I’ve read, men have created robots have modeled after female family members.

Martine Rothblatt, who created BINA48 in the image of his wife, Bina Aspen Rothblatt, and Hiroshi Ishiguro, who created his in the image of his (then) five-year old daughter.

Why?

Sources:

  • https://futurism.com/the-most-life-life-robots-ever-created/
  • https://www.wired.com/2017/10/hiroshi-ishiguro-when-robots-act-just-like-humans/

 

Neural Machine Translation architecture

Almost all such systems are built for a single language pair — so far there has not been a sufficiently simple and efficient way to handle multiple language pairs using a single model without making significant changes to the basic NMT architecture.

Google’s engineers working on NMT released a paper last year detailing a solution to multilingual NMT systems that avoided making significant changes to the architecture, which was based on a single language pair translation.

This makes me wonder what the NMT architectural structure would be and how it would differ from what is currently in place, to be optimized for multilinguality.  And what the differences would be, in how it behaves, if any.

I wonder if the system were architected in a language other than English, if it would be different. What do you get if you cross Sapir-Whorf with systems architecture?

I wonder how the machine would translate ‘soy milk’ to Romanian. Would it assume English is the source language because of ‘milk’?

 

Luc Steels and language evolution models in robots

Luc Steels’ work from more than a decade ago on the evolution of language, is one of the few examples of someone thinking about how robots could evolve langauge. He looks at the evolution of language using agents/AIs as the means of exploring how languages are learned and evolved.

What I am interested in is different than this. I am interested in how machines evolve language to communicate with each other, what this means for how humans understand machines, and what the communication will be between the two in the future. So ai/ai conversations as well as ai/ai/human.  I prefer the triad because it is important to my hypotheses that the machines interrelate with each other as well as humans.

To go back for a moment, to the evolution of language, think of it this way. You have the origins of language, the means in which children learn a language, and the ways in which a language evolves. For the latter, for example, take a teenager, whose language may well be incomprehensible to adults.  You can see linguistic variation both in the meanings of words, as well as the grammatical structures.  We don’t question this in teens, though we do usually expect them to speak our languages as well, to control, as a linguist would say, across the continuum of variations.  Now imagine two AIs as teenagers, I want to understand the way in which they evolve language in order to communicate, and what drives both the evolution of parts (words, grammar) as well as what underlies the communication needs that leads to changes in language.

So thus, how do we create models of language evolution that machines may adopt if they are allowed to change language as they see fit, for whatever reason. Mostly, now, we discuss this in terms of efficiency, but that is a very human deterministic view that I prefer to avoid at this time.  Some of the current, and unfortunately very small, data sets I have seen on the AI language evolutions have similar markers to early creole language models, and I’d like to see more data to understand if this is what is happening.

But back to Steels’ and one of his talks I quite enjoy, given at Aldebaran in 2005.  He explains what matters to his work, and the ways in which he is modeling the past.

One of his most interesting points is that embodiment is required for the evolution that he is interested in. He is attempting to model the evolution of human language, and without embodiment, it doesn’t work.

This is also very interesting to consider in the current AI/agents that are being created, and how gestures may change the ways AI and humans will and can communicate.  I haven’t yet seen much written about this, but I also haven’t explicitly looked for papers and research on language evolution and embodiment in the current collections of intelligences being designed and built.

Origins are difficult in linguistics. We don’t really know where and how languages originated in humans or why. We don’t have a clear understanding of how languages work in our minds, or how languages are learned. We can argue different positions and there is an enormous body of work and theory on this, but I wouldn’t say there is agreement. However, and while people do study this, it isn’t on par with the explorations for the origin of the universe, for example. We lack a clear and precise model of behavior, language, and intelligence, so when we are building machines that engage in these domains I might argue that we can’t be sure of the outcome. In effect, unlike the game Go, there are no first principles we can give a machine.

But, as Steel’s points out, asking these questions on origins can provide us with profound insights. And this loops back to his work, on looking at the evolution of language, and how he is going about trying to address these questions.

Here is the link to the video, Can Robots Invent Their Own Language. it’s worth a watch if you are interested in these fields.

 

 

AlphaGo and human culture

Many of the articles note that the machines are making moves that humans have never made.  Both in the original, AlphaGo Lee, and in the evolved, AlphaGo Zero, we see games that “no human has ever played.”

So many questions

  1. How do we know no human has ever played it?
  2. Is the cultural ritual surrounding Go such that as one learns, there is an expectation to adherence of tradition which a human would not diverge from?
  3. How do aesthetics play into the success of the machines? (I remember from learning Go decades ago that this mattered, but have read nothing about the aesthetics of the machines’ games. Caveat: I haven’t played in 20 years so what do I know?)
  4. Is there any difference than the first principles as given to the machines than those given to humans?
  5. Are the games played by the machines admired by the top humans?
  6. Is it expected that humans are learning from the machines and that human/human games will now be played differently?

No humans involved! (Except where they were.)

The headlines declaiming no humans were involved in AlphaGo Zero’s mastery continue to amuse me. Now if a machine had taught AGZ the principles and set it off on its path, that would be really something.

The press’ continued erasure of the humans who built the machines and provided the first principles from which to learn the game are indicative of the larger issue in that we don’t see the humans behind these decisions which are made, and thus have no insight into bias etc.

The linked headline above at least specifies that humans were not involved in the mastery, rather than the creation. AGZ played significantly more games than AGL, though this is also not often mentioned either. If you harken back to Gladwell’s supposition that it takes 10,000 hours of practice to become an expert, and we are looking at machines playing from 100,000 games to 4MM games, to ‘learn’ to excel, it is not surprising that they are outplaying humans.  We simply do not have the capacity (or longevity, and likely desire) to play so many games.

The description below, of allowing AGZ t to have access to all past experiences, which AGL did not have except when playing humans is very interesting and I’d love to know more about this decision and why it was taken.

AlphaGo Zero’s creators at Google DeepMind designed the computer program to use a tactic during practice games that AlphaGo Lee didn’t have access to. For each turn, AlphaGo Zero drew on its past experience to predict the most likely ways the rest of the game could play out, judge which player would win in each scenario and choose its move accordingly.

 

AlphaGo Lee used this kind of forethought in matches against other players, but not during practice games. AlphaGo Zero’s ability to imagine and assess possible futures during training “allowed it to train faster, but also become a better player in the end,” explains Singh, whose commentary on the study appears in the same issue of Nature.