I am fairly confident that we will see further advances in speech recognition and synthesis to the point certainly where a computer speaking is indistinguishable from a human and hopefully to where a computer can understand what any of us is saying. I don’t mean understand in the sense of comprehension but understanding in the sense of being able to transcribe spoken words into text. I am less confident about the development of Ai to the point where we can hold a conversation with a computer and much less confident now about a computer passing the Turing Test any time soon. But for the purpose of discussion, let's say that we might reasonably expect to be able to dictate a letter to a computer in the next five years with the computer having the ability to ask for clarification of anything it could not transcribe with a high probability of being correct. In other words, we've reached the point where we can all have a personal secretary.
A number of questions arise at this point. The two I'm interested in here are: would we like that and make use of it? And: would we want to go further and develop a thought interface?
There are obvious applications of this sort of technology for the handicapped. I'm fairly sure that Stephen Hawking would be delighted to try out such a system since it has the potential to vastly improve the speed at which he can communicate. But is that right? Would Hawking want to communicate faster than he does now? Perhaps he has adapted to his disability and the time it takes to dictate a letter using his present system allows him time to think and form his thoughts more clearly. My guess is that the majority of people with any kind of speaking disability would like very much to have a machine that could vocalize their thoughts.
For those of us who tend to blurt out the first thing that comes into our heads, it might not be so wonderful to have your thoughts instantly translated into text. "Wow, nice [body-part]"! for example, might be better left as an internal comment rather than one voiced loudly in public. Imagine sitting in a meeting and inadvertently voicing your opinion that the CEO is an idiot. This goes far beyond our inability to control body language - we're going to come right out and say what we're thinking. Personally, this sounds great and I'd like to forcibly apply it to people such as salesmen. Knowing what they are thinking as they try to sell me a car would definitely help my side of the bargaining process.
So let's add a button. You push the button when you want your thoughts translated and the computer picks them up over WiFi (or perhaps Bluetooth if we still have that in five years). That takes care of the gross blunders but does this provide a better interface? For writing a letter, I doubt that it does unless it can question my grammar and make helpful (but not annoying) suggestions for improvements. I'd like it to take dictation and then act as an interactive editor. That sounds pretty good to me. But this is all fairly pedestrian. We are getting pretty close now to being able to dictate with good accuracy and although grammar checkers still need improving, they are way ahead of me.
Instead of thinking about text, let's push things out a bit and think about more abstract things like art. Art is a physical expression of some sort of personal image (in many cases). The artist imagines the outcome and renders it with some physical medium (I'm including computer animation as being a physical tool). But what if you could render a dream? What if I could send you one of my dreams and you could play it back? That raises a lot of questions about dreams and their context. Do you have the right background experiences to feel the true terror I might experience from being in a very particular closed space? Probably not. But it's interesting to think (no pun intended) that we might exchange thoughts in some way.
When Microsoft first produced a speech recognition engine, I tried it out at work. I thought it would be fun to be able to tell the computer to open and close files and do a variety of other mundane things. The results we horrifying. All sorts of misinterpretations occurred with the result that I was terrified to speak at all. Who knew what file sit was moving or deleting? After a bit more tinkering it was just plain funny. We tried it on lots of people in the office with similar results. Unless you had a strong Texan accent, it had no clue and performed some apparently random act. Now let's extrapolate that experience to a beta copy of "Dream Sculptor". I'm assuming by this time we don’t need to get wired up to do thought input to the machine. I'll assume it's got some very sensitive electromagnetic sensors that can produce very accurate 3D data of what impulses are going on inside my head. So we get close to the machine and push the big red "Think Command" button. I'm pretty sure that exactly at that point one of two things will happen. Behind door 'A' is a big, blank, empty space - no thoughts at all. Behind door 'B' is some completely random thought that most definitely does not need to get inserted into my dream sculpting program. Even as I'm typing this (slowly) my mind is wandering off thinking about all sorts of random things that would somehow appear in the output. And even if I do get to craft a decent dream sequence, can I edit it? Can I swap the face of someone I know for a celebrity? I'd like to think so but undoubtedly this won't work. The human mind is far too complex and (mine at least) thoughts are far too obscure to generate any kind of coherent image.
Work has been going on for long time (certainly since the 70's) on converting nerve impulses into mechanical actions. The driving concept being the production of better prosthetic limbs. This has turned out to be more complex that was originally thought. It's not at all like turning on a light with a switch on the wall. Getting people to initiate nerve impulses for missing body parts takes a lot of practice and it takes a lot of signal processing to accurately tell what the intent was. Imagine how hard it is to do the same thing on the scale of ten of millions of firing neurons. For every pattern we might be able to detect at least part of the time there will be millions of patterns that are similar but which have quite different meaning. So while we might be able to develop a system to translate some very simple thoughts into turning a light on or off, I have great doubts that we can ever develop a thinking interface to an electronic machine. I suspect that a more productive path lies in transferring thoughts to some sort of biological machine but since I have trouble expressing my thoughts to my loved ones who have had plenty of experience in interpreting them, I am not optimistic that something grown in a Petri dish will fare any better.
Nigel