The 'wiki' section of the site is the progress of being migrated from an old site. There are some rendering issues and many of the links don't work yet.
Paper on Loops in AI and Consciousness
PREPRINT DRAFT
Title: A Characterisation of processing loops in AI and biological systems and its implications for understanding Consciousness
Abstract
The claim is made that cycles of non-physical “mental” actions are required in agents that operate within sufficiently complex environments, and that such actions require regulation through the use of a model. A framework is proposed, named the Visceral Loop, that identifies three distinct kinds of processing within such a system in terms of how it uses that model. It is shown that this can be used to characterise human thought, including about thought itself, and how it is involved when an individual concludes that they are conscious. A proof is given relating an upper bound on data available within access consciousness to the Visceral Loop characterisations of thought.
Introduction
Any computational system is limited in the complexity that it can handle within a single computational step. For embodied agents, this appears as a limit on the environmental complexity that they can sufficiently model and respond to within a single observe-infer-act cycle. For more complex problems, multiple steps of processing are required in order to determine the next physical action. Such multiple processing steps may entail, for example, further analysis of the environment in order to better model its state; or it may entail action planning over multiple iterations.
In biology, this provides scope for evolutionary pressures to trade off between a more energy hungry complex brain and a simpler less energy intensive one that takes longer to make some decisions.
An agent that regulates its environment operates within a system containing environment state S_env
, which changes with some ambient dynamics D_env(t)
. The agent performs action A_env
against the environment in order to regulate it towards some target. The environment state outcome O_env
is influenced by both D_env(t)
and A_env
. This can be summarised as such:
S_env + D_env(t) + A_env = O_env
According to the good regulator theorem, if the agent is to regulate the environment state it must be a “model of the system” (Conant & Ashby, 1970). Furthermore, we can say that the efficiency of the agent to regulate its environment depends on its accuracy in modelling the system. Errors in the accuracy of the model result in errors in the regulation of the system. In learning agents, those errors can be used for subsequent training of the model.
An embodied agent with complex actions requires an additional level of regulation. Not only must it regulate its external environment, it must also regulate its own physical state. This includes both maintaining homeostasis and controlling action for efficiency and effectiveness. Such an agent thus operates in a system that additionally has body state S_body
with ambient dynamics D_body(t)
. The agent performs action A_body
against its body, producing outcome O_body
, summarised as follows:
S_body + D_body(t) + A_body = O_body
The agent’s body actions are performed in order to regulate it towards some target, which is dynamically inferred based on its requirement for body homeostasis and for environment action A_env
. In learning agents, the inference of the required action at any given moment is based upon some sort of model. For example, Psychology and Neuroscience refer to a body schema (Proske & Gandevia, 2012) in humans as an explicit representational model.
Agents that incorporate multi-step processing have a third kind of action: one that changes its internal data state without affecting its physical state. Importantly, this system also requires regulation for the same reasons as for environment and body, but such non-physical actions may not elicit any change to S_body
or S_env
. Thus the agent must regulate its non-physical state S_mind
, having ambient dynamics D_mind(t)
. In order to do so it performs action A_mind
, producing outcome O_mind
, summarised as follows:
S_mind + D_mind(t) + A_mind = O_mind
The agent’s non-physical actions are performed in order to regulate towards some target, which is dynamically inferred based on its requirement for environment action A_env
, body action A_body
, and possibly for some form of non-physical homeostasis. Like for environment and body regulation, in order for the agent to efficiently regulate its mind state, it must model its behaviour. This suggests that it must incorporate a functional equivalent of the body schema, which this paper refers to as the mind schema. Other research has drawn similar conclusions (Graziano, 2017).
By way of example of the importance of such mind regulation, consider the case of fluent aphasia, caused by damage to the Wernicke’s area of the brain. Individuals with fluent aphasia can easily produce speech, but it is typically full of many meaningless words and often unnecessarily long winded. Wernicke’s area is associated with language comprehension and, as such, provides a corrective mechanism during speech production in a neurotypical individual (Wernicke’s area).
Visceral Loop
This paper introduces the concept of a Visceral Loop as a characterisation of processing within a looping biological or AI agent of the sort described in the introduction. The Visceral Loop is so named because it refers to an agent concluding that it experiences consciousness “in a visceral way”. It identifies that an agent with sufficient representational capabilities can, at the most optimum, conclude itself as conscious within three iterations of the loop. Each of those iterations have specific characteristics, and the Visceral Loop characterises thought as falling into one of those three iterations.
Let:
E
be the agent’s set of beliefs about the external worldB
be the agent’s set of beliefs about its own physical body (drawn from the body schema) and of bodies in general, and if it has a concept of its identity then this set includes a belief that relates other body beliefs to its identityM
be the agent’s set of beliefs about its own mind (drawn from the mind schema) and of minds in general, and if it has a concept of its identity then this set includes a belief that relates other mind beliefs to its identityf(..)
be the function executed by the agent on the specified inputs in order to draw inferences
M
can be thought of as an agent’s “theory of mind”, because it relates not only to itself but also to its ability to predict the hidden mental state of others.
Iteration 1:
Iteration 1 represents the most common kind of data processing, such as spending multiple processing cycles to refine the identification of something within the visual field. While an agent’s mind schema may be used to regulate the thought process, the result of Iteration 1 never makes any reference to it.
Let x
be an inference produced as the result of a processing step, such that it does not draw any reference to M
(ie: x ∉ M
, and if x
is a relation then x = relation(α,β)
such that α ∉ M
and β ∉ M
and α ⊄ M
and β ⊄ M
). Given some sense input or past state s
, a processing step is characterised as Visceral Loop Iteration 1 if it is of the following form:
f(s, E ∪ B ∪ M) -> x
Iteration 2:
Iteration 2 processing steps draw conclusions that relate past non-physical actions and conclusions to the agent’s theory of mind and to the agent’s concept of its identity. For example, concluding that a past data state or non-physical action is classified as “thought”, concluding whether the primary source of a past data state was external or internal, or relating the fact of an internal source to the agent’s concept of its identity.
Iteration 2 requires an agent to have sufficient representational capabilities to produce inferences that represent relations involving M
. Given some prior inference y
, a processing step is characterised as Visceral Loop Iteration 2 if it is of the following form, and the relation with respect to M
is non-empty, and it can not be characterised as Iteration 3:
f(y, E ∪ B ∪ M) -> relation(y, M)
Iteration 3:
Iteration 3 is a special case of what would otherwise be Iteration 2, but it implies stricter requirements on the agent’s introspective and representational capabilities. Iteration 3 covers the ability for the agent to develop a summary of its own mental capabilities (ie: some subset m ⊂ M
), and to consider that in relation to its conception of mental capabilities in general or to its identity (ie: M
). Iteration 3 is involved in an agent concluding itself as conscious, as will be seen in the section below.
Given some prior inference relation(z, M)
, and some subset of beliefs m ⊂ M
, a processing step is characterised as Visceral Loop Iteration 3 if it is of the following form and the relation with respect to M
is non-empty:
f(relation(z, M), E ∪ B ∪ M) -> relation(m, M)
Consciousness
The concept of the Visceral Loop provides a framework for classifying the capabilities of different processing systems. It also has important implications for understanding consciousness, particularly in its access consciousness interpretation (Block, 1995).
Two examples of the descriptive power of the Visceral Loop in relation to human consciousness are presented here.
Example 1
In this first example, the Visceral Loop is applied to understand the thought processes whereby an individual concludes themselves as conscious. Consider the following sequence of internal mental observations:
- “What’s that red blob in the tree? Oh, it’s an apple”.
- “Oh, those thoughts just came from my mind, and not from the outside world”.
- “That’s what consciousness is. I am conscious”.
The first observation is a straightforward example of Iteration 1 that does not make any reference to the agent’s theory of mind (of their own mind or of others). The concepts of “red”, “blob”, “tree” and “apple” are all contained within the set E
, and thus the inference in relation to the visual field sense input s
is of the form x_1 = relation(s, E)
.
The second observation contains two examples of Iteration 2 inferences. In the first, the individual’s processing capabilities have selected attentional focus upon the prior Iteration 1 inference, and have drawn a subsequent inference about it as being data that can be classified as a “thought”. As beliefs about “thought” are contained within M
, this is an inference of the form x_2 = relation(x_1, M)
. In the second, the individual draws a subsequent inference about the source of the Iteration 1 inference as being their own mind. The individual’s ability to classify inferences in relation to themselves also depends upon M
, and the inference is of the form x_3 = relation(x_1, M)
.
The third observation draws upon the individual having an a priori conception about consciousness in general, denoted by m_c ⊂ M
. The individual compares its prior Iteration 2 inferences x_2
and x_3
to m_c
, and produces an inference that x_2
and x_3
together satisfy the requirements for consciousness. This is another iteration 2 inference of the form x_4 = relation(x_2 & x_3, m_c)
. Finally, the individual relates m_c
, the belief of consciousness in general, to itself, which again depends on M
. That final inference is thus an Iteration 3 inference in the form x_5 = relation(m_c, M)
.
Example 2
As a second example of the descriptive power of the Visceral Loop, a theorem is presented here about the nature of human consciousness.
First an axiomatic baseline must be established. The author is unable to think of any rationale way in which they may consciously experience something and yet be unable to subsequently think about that experience and to know that they are thinking about that experience. Thus, it would seem that being able to knowingly think about our conscious experiences is a fundamental component of consciousness. The following claims are derived from this statement without further proof:
Claim 1:
- All conscious experience is subsequently available for further thought.
Claim 2:
- For all thought about conscious experience, the individual can identify that thought as being their own.
Note that these claims do not assume that all conscious experience is actually thought about; only that it is in principle available for such thought. Additionally, no assumption is made about whether other kinds of thought are consciously experienced or not.
Theorem 1:
- the content of conscious experience is upper bounded by the data about which Visceral Loop iteration 2 inferences can be produced.
Proof:
- The content of conscious experience here refers to the set of data represented and/or processed within the brain which is consciously experienced by that individual, in distinction to other data represented and/or processed in the brain which is not consciously experienced.
- Thought is a computational process, and thus is a series of inferences.
- As per claim 1, all of conscious experience must be available for producing subsequent inferences about those conscious experiences.
- As per claim 2, the individual must be able to identify that they produced those inferences.
- In order for an individual to identify an inference as being their own, they must have some beliefs about their inference capabilities and how they relate to themselves as an individual entity. This is included in the set
M
, which iteration 2 produces inferences in relation to, and which is not directly accessible for inferences within iteration 1. - Imagine some supposed experience, and an inference
i
produced about that experience. Additionally imagine that an iteration 2 inference cannot be produced abouti
, for example, due to some incompatibility of structure, lack of data path to iteration 2 processing capabilities, or inherent limitation in iteration 2 processing capabilities. The inferencei
cannot be identified in relation to the individual. As such, the supposed experience fails on Claim 2 andi
must be in actual fact an inference about some sort of non-conscious experience. - Thus, an experience is not a conscious experience if it can only lead to inferences which cannot be included in an iteration 2 inference.
Summary and Conclusions
The claim has been made that sufficiently complex environments and agent bodies require a trade-off between the inferential computing power of a single processing step versus the use of multi-step processing. In order to be a “good regulator” of its own non-physical actions, the agent must model its non-physical behaviours. Various different forms of such modelling are possible, and the characterisations offered by the Visceral Loop provide an insight into what kinds of self-referential thought are possible depending on the kind of model in use.
In deep reinforcement learning, agents are classified as being model-free or model-based (Lazaridis, Fachantidis, & Vlahavas, 2020). A model-free agent, which implicitly is a model, but which does not contain an explicit representational model, may never exceed Iteration 1 processing as there is no explicit model for it to introspect. A typical model-based RL agent of the sort used in most research today has an explicit model, but it’s own inferential capabilities would not directly introspect that model. However, it should not require much effort to enable a model-based RL agent to access its model and to draw inferences from it.
The Visceral Loop has been shown to offer significant insight into consciousness. More work is possible here. It should be noted that there are aspects of consciousness that the Visceral Loop makes no claim over. In particular, it offers no explanation for the so called “hard problem” of phenomenal consciousness (Chalmers, 1995) - the “what it feels like” aspect of consciousness. However, the author believes that stronger claims about the natures of both access consciousness and phenomenal consciousness can yet be made, based on the framework of the Visceral Loop.
The Visceral Loop can be used to understand other aspects of thought. For example, it explains why fRMI studies have suggested that we become aware of a decision after it is made (Soon, Brass, Heinze & Haynes, 2008) - because it takes extra processing cycles to consciously consider the fact of the decision being made. In short, we can only think about one thing at a time, so the decision itself and thought about the decision require separate steps.
tbd:
- Tie VL to need for computational models, as discussed by (Michael D. Colagrosso, Michael C. Mozer. 2004. Theories Of Access Consciousness. NIPS.)
- elaborate further on ramifications of VL.
- Look for “Libet et al, 1983” as another example of becoming aware of something after the fact: “S is aware of something (P-consciousness) he mistakenly believes caused his response when, in fact, the latter was triggered by cerebral events which occurred prior to its phenomenal representation (see Libet eta!. 1983).”
The work presented here has implications for understanding consciousness, for the design of AI systems, and perhaps forms one of the necessary building blocks towards artificial general intelligence (AGI).
References
Block, N. (1995). On a confusion about a function of consciousness. Brain and Behavioral Sciences, 18(2), 227–247. https://doi.org/10.1017/S0140525X00038188. [Full Text]
Chalmers, D. J. (1995). Facing up to the problem of consciousness. Journal of Consciousness Studies 2(3): 200-19. http://dx.doi.org/10.1093/acprof:oso/9780195311105.003.0001. [Full Text]
Conant, R. C., and Ashby, W. R. (1970). Every good regulator of a system must be a model of that system. Int. J. Systems Sci., vol. 1, No. 2, pp 89-97. https://doi.org/10.1080/00207727008920220. [Full Text]
Graziano, M. S. A. (2017). The Attention Schema Theory: A Foundation for Engineering Artificial Consciousnes. Front. Robot. AI. https://doi.org/10.3389/frobt.2017.00060.
Lazaridis, A., Fachantidis, A., & Vlahavas, I. (2020). Deep Reinforcement Learning: A State-of-the-Art Walkthrough. J. Artif. Intell. Res., 69, 1421-1471. https://doi.org/10.1613/jair.1.12412. [Full Text]
Proske, U., and Gandevia, S. C. (2012). The Proprioceptive Senses: Their Roles in Signaling Body Shape, Body Position and Movement, and Muscle Force. Physiological Reviews 2012 92:4, pp 1651-1697. https://doi.org/10.1152/physrev.00048.2011.
Soon, C. S., Brass, M., Heinze, H. J., & Haynes, J. D. (2008). Unconscious determinants of free decisions in the human brain. Nature neuroscience, 11(5), 543–545. https://doi.org/10.1038/nn.2112. [Full Text/]
Wernicke’s area. (n.d.). In Wikepedia. https://en.wikipedia.org/wiki/Wernicke%27s_area.