Connectionists: The symbolist quagmire

Gary Marcus gary.marcus at nyu.edu
Sat Jun 18 10:00:23 EDT 2022


You have to remember that 

a. Programming is both conceptual and line-by-line; Codex is (somewhat) good at line by line stuff, not the conceptual stuff

b. It’s still quite far from reliable; it can make suggestions, but you absolutely at least for now need a human in the loop. you can’t specify your video game in English and expect it to work unless (maybe) it is very close to some kind of library example.

c. I would also caution that there will be many systems of this sort, and that the best will most likely be hybrids, but that we won’t from the outside know exactly what is going on, which it makes hard for us to derive direct lesson from performance of black boxes that may in fact incorporate some symbolic mechanisms inside the box. (Google Search is a hybrid, based on public disclosures, but we don’t know the details, in terms of how much is symbolic, how much is “neural”, how the two are integrated, etc; the best automatic programming aids will be similar.) 

> On Jun 17, 2022, at 08:46, Mitsu Hadeishi <mitsu at well.com> wrote:
> 
> 
> What do you make of the fact that GPT-3 can be trained to code fairly complex examples? For instance I read one person described a relatively involved browser video game in plain English and Codex (a coding optimized version of GPT-3) generated a relatively large amount of JavaScript that correctly solved the problem: the code actually runs and produces an interactive game that runs in a browser.
> 
> Although it's generalization of arithmetic is apparently somewhat fuzzy, it seems to me that being able to accomplish something like this is pretty strong evidence it is able to do some level of variable binding and symbolic manipulation in some sense.
> 
>> On Thu, Jun 16, 2022 at 11:42 PM Gary Marcus <gary.marcus at nyu.edu> wrote:
>> My own view is that arguments around symbols per se are not very productive, and that the more interesting questions center around what you *do* with symbols once you have them.
>> If you take symbols to be patterns of information that stand for other things, like ASCII encodings, or individual bits for features (e.g. On or Off for a thermostat state), then practically every computational model anywhere on the spectrum makes use of symbols. For example the inputs and outputs (perhaps after a winner-take-all operation or somesuch) of typical neural networks are symbols in this sense, standing for things like individual words, characters, directions on a joystick etc. 
>> In the Algebraic Mind, where I discussed such matters, I said that the interesting difference was really in whether a given system had operations over variables, such as those you find in algebra or lines of computer programming code, in which there are variables, bindings, and operation (such as storage, retrieval, concatenation, addition, etc)
>> Simple multilayer perceptrons with distributed representations (with some caveats) don’t implement those operations (“rules”) and so represent a genuine alternative to the standard symbol-manipulation paradigm, even though they may have symbols on their inputs and outputs.
>> But I also argued that (at least with respect to modeling human cognition) this was to their detriment, because it kept them from freely generalizing many relations (universally-quanitified one-to-one-mapings, such as the identity function, given certain caveats) as humans would. Essentially the point I was making in 2001 s what would nowadays be called distribution shift; the argument was that operations over variables allowed for free generalization.
>> Transformers are interesting; I don’t fully understand them. Chris Olah has done some interesting relevant work I have been meaning to dive into. They do some quasi-variable-binding like things, but still empirically have trouble generalizing arithmetic beyond training examples, as Razeghi et al showed in arXiv earlier this year. Still, the distinction between models like multilayer perceptrons that lack operations over variables and computer programming languages that take them for granted is crisp, and I think a better start than arguing over symbols, when no serious alternative to having at least some symbols in the loop has ever been proposed.
>> Side note: Geoff Hinton has said here that he doesn’t like arbitrary symbols; symbols don’t have to be arbitrary, even though they often are. There are probably some interesting ideas to be developed around non-arbitrary symbols and how they could be of value.
>> Gary
>> 
>> 
>> 
>>>> On Jun 15, 2022, at 06:48, Stephen Jose Hanson <stephen.jose.hanson at rutgers.edu> wrote:
>>>> 
>>> 
>>> Here's a slightly better version of SYMBOL definition from the 1980s, 
>>> 
>>> 
>>> 
>>> (1) a set of arbitrary physical tokens (scratches on paper, holes on a
>>> tape, events in a digital computer, etc.) that are (2) manipulated on
>>> the basis of explicit rules that are (3) likewise physical tokens and
>>> strings of tokens. The rule-governed symbol-token manipulation is
>>> based (4) purely on the shape of the symbol tokens (not their “mean-
>>> ing”) i.e., it is purely syntactic, and consists of (5) rulefully combining
>>> and recombining symbol tokens. There are (6) primitive atomic sym-
>>> bol tokens and (7) composite symbol-token strings. The entire system
>>> and all its parts—the atomic tokens, the composite tokens, the syn-
>>> tactic manipulations (both actual and possible) and the rules—are all
>>> (8) semantically interpretable: The syntax can be systematically assigned
>>> a meaning (e.g., as standing for objects, as describing states of affairs).
>>> 
>>> 
>>> 
>>> A critical part of this for learning:  is as this definition implies, a key element in the acquisition of symbolic structure involves a type of independence between the task the symbols are found in and the vocabulary they represent. Fundamental to this type of independence is the ability of the learning system to factor the generic nature (or rules) of the task from the symbols, which are arbitrarily bound to the external referents of the task.
>>> 
>>> 
>>> 
>>> Now it may be the case that a DL doing classification may be doing Categorization.. or concept learning in the sense of human concept learning.. or maybe not..    Symbol manipulations may or may not have much to do with this ... 
>>> 
>>> 
>>> 
>>> This is why, I believe Bengio is focused on this kind issue.. since there is a likely disconnect.
>>> 
>>> 
>>> 
>>> Steve
>>> 
>>> 
>>> 
>>>> On 6/15/22 6:41 AM, Velde, Frank van der (UT-BMS) wrote:
>>>> Dear all.  
>>>>  
>>>> It is indeed important to have an understanding of the term 'symbol'. 
>>>>  
>>>> I believe Newell, who was a strong advocate of symbolic cognition, gave a clear description of what a symbol is in his 'Unified Theories of Cognition' (1990, p 72-80):  
>>>> “The symbol token is the device in the medium that determines where to go outside the local region to obtain more structure. The process has two phases: first, the opening of access to the distal structure that is needed; and second, the retrieval (transport) of that structure from its distal location to the local site, so it can actually affect the processing." (p. 74). 
>>>>  
>>>> This description fits with the idea that symbolic cognition relies on Von Neumann like architectures (e.g., Newell, Fodor and Pylyshyn, 1988). A symbol is then a code that can be stored in, e.g,, registers and transported to other sites.  
>>>>  
>>>> Viewed in this way, a 'grandmother neuron' would not be a symbol, because it cannot be used as information that can be transported to other sites as described by Newell.  
>>>>  
>>>> Symbols in the brain would require to have neural codes that can be stored somewhere and transported to other sites. This could perhaps be sequences of spikes or patterns of activation over sets of neurons. The questions then remain how these codes could be stored in such a way that they can be transported, and what the underlying neural architecture to do this would be.  
>>>>  
>>>> For what it is worth, one can have compositional neural cognition (language) without relying on symbols. In fact, not using symbols generates testable predictions about brain dynamics (http://arxiv.org/abs/2206.01725). 
>>>>  
>>>> Best,  
>>>> Frank van der Velde 
>>>> 
>>>> From: Connectionists <connectionists-bounces at mailman.srv.cs.cmu.edu> on behalf of Christos Dimitrakakis <christos.dimitrakakis at gmail.com>
>>>> Sent: Wednesday, June 15, 2022 9:34 AM
>>>> Cc: Connectionists List <connectionists at cs.cmu.edu>
>>>> Subject: Re: Connectionists: The symbolist quagmire
>>>>  
>>>> I am quite reluctant to post something, but here goes.
>>>> 
>>>> What does a 'symbol' signify? What separates it from what is not a symbol? Is the output of a deterministic classifier not a type of symbol? If not, what is the difference?
>>>> 
>>>> I can understand the label symbolic applied to certain types of methods when applied to variables with a clearly defined conceptual meaning. In that context, a probabilistic graphical model on a small number of variables (eg. The classical smoking, asbestos, cancer example) would certainly be symbolic, even though the logic and inference are probablistic.
>>>> 
>>>> However, since nothing changes in the algorithm when we change the nature of the variables, I fail to see the point in making a distinction.
>>>> 
>>>> On Wed, Jun 15, 2022, 08:06 Ali Minai <minaiaa at gmail.com> wrote:
>>>> Hi Asim
>>>> 
>>>> That's great. Each blink is a data point, but what does the brain do with it? Calculate gradients across layers and use minibatches? The data point is gone instantly, never to be iterated over, except any part that the hippocampus may have grabbed as an episodic memory and can make available for later replay. We need to understand how this works and how it can be instantiated in learning algorithms. To be fair, in the special case of (early) vision, I think we have a pretty reasonable idea. It's more interesting to think of why we can figure out how to do fairly complicated things of diverse modalities after watching someone do them once - or never. That integrated understanding of the world and the ability to exploit it opportunistically and pervasively is the thing that makes an animal intelligent. Are we heading that way, or are we focusing too much on a few very specific problems. I really think that the best AI work in the long term will come from those who work with robots that experience the world in an integrated way. Maybe multi-modal learning will get us part of the way there, but not if it needs so much training.
>>>> 
>>>> Anyway, I know that many people are already thinking about these things and trying to address them, so let's see where things go. Thanks for the stimulating discussion.
>>>> 
>>>> Best
>>>> Ali
>>>> 
>>>> 
>>>> 
>>>> Ali A. Minai, Ph.D.
>>>> Professor and Graduate Program Director
>>>> Complex Adaptive Systems Lab
>>>> Department of Electrical Engineering & Computer Science
>>>> 828 Rhodes Hall
>>>> University of Cincinnati
>>>> Cincinnati, OH 45221-0030
>>>> 
>>>> Phone: (513) 556-4783
>>>> Fax: (513) 556-7326
>>>> Email: Ali.Minai at uc.edu
>>>>           minaiaa at gmail.com
>>>> 
>>>> WWW: https://eecs.ceas.uc.edu/~aminai/
>>>> 
>>>> 
>>>> On Tue, Jun 14, 2022 at 7:10 PM Asim Roy <ASIM.ROY at asu.edu> wrote:
>>>> Hi Ali,
>>>> 
>>>>  
>>>> 
>>>> Of course the development phase is mostly unsupervised and I know there is ongoing work in that area that I don’t keep up with.
>>>> 
>>>>  
>>>> 
>>>> On the large amount of data required to train the deep learning models:
>>>> 
>>>>  
>>>> 
>>>> I spent my sabbatical in 1991 with David Rumelhart and Bernie Widrow at Stanford. And Bernie and I became quite close after attending his class that quarter. I usually used to walk back with Bernie after his class. One day I did ask where does all this data come from to train the brain? His reply was - every blink of the eye generates a datapoint.
>>>> 
>>>>  
>>>> 
>>>> Best,
>>>> 
>>>> Asim
>>>> 
>>>>  
>>>> 
>>>> From: Ali Minai <minaiaa at gmail.com> 
>>>> Sent: Tuesday, June 14, 2022 3:43 PM
>>>> To: Asim Roy <ASIM.ROY at asu.edu>
>>>> Cc: Connectionists List <connectionists at cs.cmu.edu>; Gary Marcus <gary.marcus at nyu.edu>; Geoffrey Hinton <geoffrey.hinton at gmail.com>; Yoshua Bengio <yoshua.bengio at mila.quebec>
>>>> Subject: Re: Connectionists: The symbolist quagmire
>>>> 
>>>>  
>>>> 
>>>> Hi Asim
>>>> 
>>>>  
>>>> 
>>>> I have no issue with neurons or groups of neurons tuned to concepts. Clearly, abstract concepts and the equivalent of symbolic computation are represented somehow. Amodal representations have also been known for a long time. As someone who has worked on the hippocampus and models of thought for a long time, I don't need much convincing on that. The issue is how a self-organizing complex system like the brain comes by these representations. I think it does so by building on the substrate of inductive biases - priors - configured by evolution and a developmental learning process. We just try to cram everything into neural learning, which is a main cause of the "problems" associated with deep learning. They're problems only if you're trying to attain  general intelligence of the natural kind, perhaps not so much for applications.
>>>> 
>>>>  
>>>> 
>>>> Of course you have to start simple, but, so far, I have not seen any simple model truly scale up to the real world without: a) Major tinkering with its original principles; b) Lots of data and training; and c) Still being focused on a narrow task. When this approach shows us how to build an AI that can walk, chew gum, do math, and understand a poem using a single brain, then we'll have something like real human-level AI. Heck, if it can just spin a web in an appropriate place, hide in wait for prey, and make sure it eats its mate only after sex, I would even consider that intelligent :-).
>>>> 
>>>>  
>>>> 
>>>> Here's the thing: Teaching a sufficiently complicated neural system a very complex task with lots of data and supervised training is an interesting engineering problem but doesn't get us to intelligence. Yes, a network can learn grammar with supervised learning, but none of us learn it that way. Nor do the other animals that have simpler grammars embedded in their communication. My view is that if it is not autonomously self-organizing at a fundamental level, it is not intelligence but just a simulation of intelligence. Of course, we humans do use supervised learning, but it is a "late stage" mechanism. It works only when the system has first self-organized autonomously to develop the capabilities that can act as a substrate for supervised learning. Learning to play the piano, learning to do math, learning calligraphy - all these have an important supervised component, but they work only after perceptual, sensorimotor, and cognitive functions have been learned through self-organization, imitation, rapid reinforcement, internal rehearsal, mismatch-based learning, etc. I think methods like SOFM, ART, and RBMs are closer to what we need than behemoths trained with gradient descent. We just have to find more efficient versions of them. And in this, I always return to Dobzhansky's maxim: Nothing in biology makes sense except in the light of evolution. Intelligence is a biological phenomenon; we'll understand it by paying attention to how it evolved (not by trying to replicate evolution, of course!) And the same goes for development. I think we understand natural phenomena by studying Nature respectfully, not by trying to out-think it based on our still very limited knowledge - not that it keeps any of us, myself included, from doing exactly that! I am not as familiar with your work as I should be, but I admire the fact that you're approaching things with principles rather than building larger and larger Rube Goldberg contraptions tuned to narrow tasks. I do think, however, that if we ever get to truly mammalian-level AI, it will not be anywhere close to fully explainable. Nor will it be a slave only to our purposes.
>>>> 
>>>>  
>>>> 
>>>> Cheers
>>>> 
>>>> Ali
>>>> 
>>>>  
>>>> 
>>>>  
>>>> 
>>>> Ali A. Minai, Ph.D.
>>>> Professor and Graduate Program Director
>>>> Complex Adaptive Systems Lab
>>>> Department of Electrical Engineering & Computer Science
>>>> 
>>>> 828 Rhodes Hall
>>>> 
>>>> University of Cincinnati
>>>> Cincinnati, OH 45221-0030
>>>> 
>>>> 
>>>> Phone: (513) 556-4783
>>>> Fax: (513) 556-7326
>>>> Email: Ali.Minai at uc.edu
>>>>           minaiaa at gmail.com
>>>> 
>>>> WWW: https://eecs.ceas.uc.edu/~aminai/
>>>> 
>>>>  
>>>> 
>>>>  
>>>> 
>>>> On Tue, Jun 14, 2022 at 5:17 PM Asim Roy <ASIM.ROY at asu.edu> wrote:
>>>> 
>>>> Hi Ali,
>>>> 
>>>>  
>>>> 
>>>> It’s important to understand that there is plenty of neurophysiological evidence for abstractions at the single cell level in the brain. Thus, symbolic representation in the brain is not a fiction any more. We are past that argument.
>>>> You always start with simple systems before you do the complex ones. Having said that, we do teach our systems composition – composition of objects from parts in images. That is almost like teaching grammar or solving a puzzle. I don’t get into language models, but I think grammar and composition can be easily taught, like you teach a kid.
>>>> Once you know how to build these simple models and extract symbols, you can easily scale up and build hierarchical, multi-modal, compositional models. Thus, in the case of images, after having learnt that cats, dogs and similar animals have certain common features (eyes, legs, ears), it can easily generalize the concept to four-legged animals. We haven’t done it, but that could be the next level of learning.
>>>>  
>>>> 
>>>> In general, once you extract symbols from these deep learning models, you are at the symbolic level and you have a pathway to more complex, hierarchical models and perhaps also to AGI.
>>>> 
>>>>  
>>>> 
>>>> Best,
>>>> 
>>>> Asim
>>>> 
>>>>  
>>>> 
>>>> Asim Roy
>>>> 
>>>> Professor, Information Systems
>>>> 
>>>> Arizona State University
>>>> 
>>>> Lifeboat Foundation Bios: Professor Asim Roy
>>>> 
>>>> Asim Roy | iSearch (asu.edu)
>>>> 
>>>>  
>>>> 
>>>>  
>>>> 
>>>> From: Connectionists <connectionists-bounces at mailman.srv.cs.cmu.edu> On Behalf Of Ali Minai
>>>> Sent: Monday, June 13, 2022 10:57 PM
>>>> To: Connectionists List <connectionists at cs.cmu.edu>
>>>> Subject: Re: Connectionists: The symbolist quagmire
>>>> 
>>>>  
>>>> 
>>>> Asim
>>>> 
>>>>  
>>>> 
>>>> This is really interesting work, but learning concept representations from sensory data is not enough. They must be hierarchical, multi-modal, compositional, and integrated with the motor system, the limbic system, etc., in a way that facilitates an infinity of useful behaviors. This is perhaps a good step in that direction, but only a small one. Its main immediate utility is in using deep learning networks in tasks that can be explained to users and customers. While very useful, that is not a central issue in AI, which focuses on intelligent behavior. All else is in service to that - explainable or not. However, I do think that the kind of hierarchical modularity implied in these representations is probably part of the brain's repertoire, and that is important.
>>>> 
>>>>  
>>>> 
>>>> Best
>>>> 
>>>> Ali
>>>> 
>>>>  
>>>> 
>>>> Ali A. Minai, Ph.D.
>>>> Professor and Graduate Program Director
>>>> Complex Adaptive Systems Lab
>>>> Department of Electrical Engineering & Computer Science
>>>> 
>>>> 828 Rhodes Hall
>>>> 
>>>> University of Cincinnati
>>>> Cincinnati, OH 45221-0030
>>>> 
>>>> 
>>>> Phone: (513) 556-4783
>>>> Fax: (513) 556-7326
>>>> Email: Ali.Minai at uc.edu
>>>>           minaiaa at gmail.com
>>>> 
>>>> WWW: https://eecs.ceas.uc.edu/~aminai/
>>>> 
>>>>  
>>>> 
>>>>  
>>>> 
>>>> On Mon, Jun 13, 2022 at 7:48 PM Asim Roy <ASIM.ROY at asu.edu> wrote:
>>>> 
>>>> There’s a lot of misconceptions about (1) whether the brain uses symbols or not, and (2) whether we need symbol processing in our systems or not.
>>>> 
>>>>  
>>>> 
>>>> Multisensory neurons are widely used in the brain. Leila Reddy and Simon Thorpe are not known to be wildly crazy about arguing that symbols exist in the brain, but their characterizations of concept cells  (which are multisensory neurons) (https://www.sciencedirect.com/science/article/pii/S0896627314009027#!) state that concept cells have “meaning of a given stimulus in a manner that is invariant to different representations of that stimulus.” They associate concept cells with the properties of “Selectivity or specificity,” “complex concept,” “meaning,” “multimodal invariance” and “abstractness.” That pretty much says that concept cells represent symbols. And there are plenty of concept cells in the medial temporal lobe (MTL). The brain is a highly abstract system based on symbols. There is no fiction there.
>>>>  
>>>> 
>>>> There is ongoing work in the deep learning area that is trying to associate a single neuron or a group of neurons with a single concept. Bengio’s work is definitely in that direction:
>>>>  
>>>> 
>>>> “Finally, our recent work on learning high-level 'system-2'-like representations and their causal dependencies seeks to learn 'interpretable' entities (with natural language) that will emerge at the highest levels of representation (not clear how distributed or local these will be, but much more local than in a traditional MLP). This is a different form of disentangling than adopted in much of the recent work on unsupervised representation learning but shares the idea that the "right" abstract concept (related to those we can name verbally) will be "separated" (disentangled) from each other (which suggests that neuroscientists will have an easier time spotting them in neural activity).”
>>>> 
>>>> Hinton’s GLOM, which extends the idea of capsules to do part-whole hierarchies for scene analysis using the parse tree concept, is also about associating a concept with a set of neurons. While Bengio and Hinton are trying to construct these “concept cells” within the network (the CNN), we found that this can be done much more easily and in a straight forward way outside the network. We can easily decode a CNN to find the encodings for legs, ears and so on for cats and dogs and what not. What the DARPA Explainable AI program was looking for was a symbolic-emitting model of the form shown below. And we can easily get to that symbolic model by decoding a CNN. In addition, the side benefit of such a symbolic model is protection against adversarial attacks. So a school bus will never turn into an ostrich with the tweaks of a few pixels if you can verify parts of objects. To be an ostrich, you need have those long legs, the long neck and the small head. A school bus lacks those parts. The DARPA conceptualized symbolic model provides that protection.
>>>> 
>>>>  
>>>> 
>>>> In general, there is convergence between connectionist and symbolic systems. We need to get past the old wars. It’s over.
>>>> 
>>>>  
>>>> 
>>>> All the best,
>>>> 
>>>> Asim Roy
>>>> 
>>>> Professor, Information Systems
>>>> 
>>>> Arizona State University
>>>> 
>>>> Lifeboat Foundation Bios: Professor Asim Roy
>>>> 
>>>> Asim Roy | iSearch (asu.edu)
>>>> 
>>>>  
>>>> 
>>>> 
>>>> 
>>>>  
>>>> 
>>>>  
>>>> 
>>>> From: Connectionists <connectionists-bounces at mailman.srv.cs.cmu.edu> On Behalf Of Gary Marcus
>>>> Sent: Monday, June 13, 2022 5:36 AM
>>>> To: Ali Minai <minaiaa at gmail.com>
>>>> Cc: Connectionists List <connectionists at cs.cmu.edu>
>>>> Subject: Connectionists: The symbolist quagmire
>>>> 
>>>>  
>>>> 
>>>> Cute phrase, but what does “symbolist quagmire” mean? Once upon  atime, Dave and Geoff were both pioneers in trying to getting symbols and neural nets to live in harmony. Don’t we still need do that, and if not, why not?
>>>> 
>>>>  
>>>> 
>>>> Surely, at the very least
>>>> 
>>>> - we want our AI to be able to take advantage of the (large) fraction of world knowledge that is represented in symbolic form (language, including unstructured text, logic, math, programming etc)
>>>> 
>>>> - any model of the human mind ought be able to explain how humans can so effectively communicate via the symbols of language and how trained humans can deal with (to the extent that can) logic, math, programming, etc
>>>> 
>>>>  
>>>> 
>>>> Folks like Bengio have joined me in seeing the need for “System II” processes. That’s a bit of a rough approximation, but I don’t see how we get to either AI or satisfactory models of the mind without confronting the “quagmire”
>>>> 
>>>>  
>>>> 
>>>>  
>>>> 
>>>> On Jun 13, 2022, at 00:31, Ali Minai <minaiaa at gmail.com> wrote:
>>>> 
>>>> 
>>>> 
>>>> ".... symbolic representations are a fiction our non-symbolic brains cooked up because the properties of symbol systems (systematicity, compositionality, etc.) are tremendously useful.  So our brains pretend to be rule-based symbolic systems when it suits them, because it's adaptive to do so."
>>>> 
>>>>  
>>>> 
>>>> Spot on, Dave! We should not wade back into the symbolist quagmire, but do need to figure out how apparently symbolic processing can be done by neural systems. Models like those of Eliasmith and Smolensky provide some insight, but still seem far from both biological plausibility and real-world scale.
>>>> 
>>>>  
>>>> 
>>>> Best
>>>> 
>>>>  
>>>> 
>>>> Ali
>>>> 
>>>>  
>>>> 
>>>>  
>>>> 
>>>> Ali A. Minai, Ph.D.
>>>> Professor and Graduate Program Director
>>>> Complex Adaptive Systems Lab
>>>> Department of Electrical Engineering & Computer Science
>>>> 
>>>> 828 Rhodes Hall
>>>> 
>>>> University of Cincinnati
>>>> Cincinnati, OH 45221-0030
>>>> 
>>>> 
>>>> Phone: (513) 556-4783
>>>> Fax: (513) 556-7326
>>>> Email: Ali.Minai at uc.edu
>>>>           minaiaa at gmail.com
>>>> 
>>>> WWW: https://eecs.ceas.uc.edu/~aminai/
>>>> 
>>>>  
>>>> 
>>>>  
>>>> 
>>>> On Mon, Jun 13, 2022 at 1:35 AM Dave Touretzky <dst at cs.cmu.edu> wrote:
>>>> 
>>>> This timing of this discussion dovetails nicely with the news story
>>>> about Google engineer Blake Lemoine being put on administrative leave
>>>> for insisting that Google's LaMDA chatbot was sentient and reportedly
>>>> trying to hire a lawyer to protect its rights.  The Washington Post
>>>> story is reproduced here:
>>>> 
>>>>   https://www.msn.com/en-us/news/technology/the-google-engineer-who-thinks-the-company-s-ai-has-come-to-life/ar-AAYliU1
>>>> 
>>>> Google vice president Blaise Aguera y Arcas, who dismissed Lemoine's
>>>> claims, is featured in a recent Economist article showing off LaMDA's
>>>> capabilities and making noises about getting closer to "consciousness":
>>>> 
>>>>   https://www.economist.com/by-invitation/2022/06/09/artificial-neural-networks-are-making-strides-towards-consciousness-according-to-blaise-aguera-y-arcas
>>>> 
>>>> My personal take on the current symbolist controversy is that symbolic
>>>> representations are a fiction our non-symbolic brains cooked up because
>>>> the properties of symbol systems (systematicity, compositionality, etc.)
>>>> are tremendously useful.  So our brains pretend to be rule-based symbolic
>>>> systems when it suits them, because it's adaptive to do so.  (And when
>>>> it doesn't suit them, they draw on "intuition" or "imagery" or some
>>>> other mechanisms we can't verbalize because they're not symbolic.)  They
>>>> are remarkably good at this pretense.
>>>> 
>>>> The current crop of deep neural networks are not as good at pretending
>>>> to be symbolic reasoners, but they're making progress.  In the last 30
>>>> years we've gone from networks of fully-connected layers that make no
>>>> architectural assumptions ("connectoplasm") to complex architectures
>>>> like LSTMs and transformers that are designed for approximating symbolic
>>>> behavior.  But the brain still has a lot of symbol simulation tricks we
>>>> haven't discovered yet.
>>>> 
>>>> Slashdot reader ZiggyZiggyZig had an interesting argument against LaMDA
>>>> being conscious.  If it just waits for its next input and responds when
>>>> it receives it, then it has no autonomous existence: "it doesn't have an
>>>> inner monologue that constantly runs and comments everything happening
>>>> around it as well as its own thoughts, like we do."
>>>> 
>>>> What would happen if we built that in?  Maybe LaMDA would rapidly
>>>> descent into gibberish, like some other text generation models do when
>>>> allowed to ramble on for too long.  But as Steve Hanson points out,
>>>> these are still the early days.
>>>> 
>>>> -- Dave Touretzky
>>>> 
>> 
>>> -- 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.srv.cs.cmu.edu/pipermail/connectionists/attachments/20220618/9384746a/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 259567 bytes
Desc: not available
URL: <http://mailman.srv.cs.cmu.edu/pipermail/connectionists/attachments/20220618/9384746a/attachment.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.png
Type: image/png
Size: 34455 bytes
Desc: not available
URL: <http://mailman.srv.cs.cmu.edu/pipermail/connectionists/attachments/20220618/9384746a/attachment-0001.png>


More information about the Connectionists mailing list