Featured Video Play Icon

Guidance of Visual Attention

Denis Buehler (Universidad Nacional Autónoma de México)

[Jump to comments]


Abstract: I offer a novel way of addressing Harry Frankfurt’s ‘problem of action’: From reflection on empirical psychological research, I develop an empirical explication of the notion guidance by the individual. I speculate on how this explication may contribute to understanding the nature of agency. [1]

1 Understanding Agency[2]

The deepest challenge in action theory, says Harry Frankfurt, is to explain the distinction between processes that an individual guides and processes that occur without an individual’s guidance.[3] We have to understand the nature of guidance in order to understand the nature of agency.

A macaque searches some bushes for raspberries. He actively guides his visual attention from one location to the next until he finds the berries. When a bright object abruptly appears in the periphery of the macaque’s field of view, the object captures the macaque’s attention. His attention then shifts passively. A jumping spider walks across the jungle floor toward her prey. The spider actively guides the movements of her legs in so walking. When the spider’s legs contract in a shreck-reaction upon getting in contact with a toxic substance, the spider does not guide her legs’ movements. Often when you solve a chess problem you systematically consider different possible solutions. You are then actively guiding your reasoning toward a solution of the problem. You do not guide your reasoning when the solution ‘pops’ into your mind.

How can we understand this distinction? As I will explain later, all extant accounts of agency fail to answer this question in a satisfying way. This fact motivates the search for a novel account. It also motivates the application of a new methodology in action theory. In this paper I attempt both.

Much of this paper is devoted to developing an empirical explication of individuals’ guidance by investigating actual primates’ guidance of visual attention. I will argue for this explication from a case study of such attention shifts’ psychology. The explication’s basic idea is that some specific type of psychological process realizes these individuals’ guidance. I show that

Guidance of Visual Attention

Central executive control realizes primates’ guidance of their visual attention shifts.

Empirical psychology provides extensive evidence that actual primates have a central executive system that coordinates and integrates psychological states and processes from different modalities and other psychological subsystems. Central executive control is the control that the central executive system exercises over a process. My appeal to central executive system meets two requirements on an explication of individuals’ guidance. First, central executive control correlates with those individuals’ guidance. But second, appeals to central executive control help deepen our understanding of individuals’ guidance.

An explication of a concept must not be conflated with a reductive account, a definition, or the giving of necessary and sufficient conditions on a nature.[4] An explication of a concept need not even apply to all instances of that concept in the actual world. Many explications identify uses of a concept that centrally guide its applications. Many explications identify salient marks of such uses of a concept. In either way explications can deepen understanding of the concept. Explications often deepen our understanding of a notion by deepening our understanding of the kind that the notion denotes. Such explications often depend on empirical information.[5] I do not claim that my initial explication offers a necessary condition on guidance. I do not claim that it offers a condition on all animals, even those in the actual world. I do not even claim to have shown that all exercises of agency in actual primates are constituted by central executive control. My more modest, initial aim is to offer an empirical explication of individuals’ guidance.

Nevertheless I will suggest that my empirical explication may bear on the constitution of guidance, and hence of agency. A constitutive explication of a concept provides a necessary condition on the kind that the concept denotes, appeals to which condition deepen our understanding of the concept. A constitutive explication contributes to an account of the kind. A cursory overview of guided processes in actual primates makes it plausible that all these processes are subject to central executive control. I suggest that therefore

Guidance in Primates

Central executive control (partly) constitutes primates’ guidance.

            Next, I conjecture that this result may point us in the direction of an account of guidance for creatures beyond primates.

Constitution of Guidance

The control of a process by an individual’s central structures (partly) constitutes that individual’s guidance.

            Central structures function to integrate and coordinate activity in an individual’s sub-systems. These structures need not involve central executive control. They need not be psychological structures. I conjecture that for all agents, when they guide their act, some central structure unifies their different capacities so as to constitute the individual’s guidance of that act. Showing that either proposal specifies a constitutive condition on agency, even just in primates, would not only require investigating other kinds of agency in primates and other animals. It would also require arguing that my explication provides a necessary condition on agency’s nature. Both tasks are beyond the scope of the present paper. Nevertheless I conclude by explaining why I think that the two constitutive explications hold some promise of being true.

My first aim in this paper is to contribute to our understanding of individuals’ guidance. My second aim is to advertise a new methodology for action theory. I develop my explications of guidance from reflection on empirical psychological research. Frankfurt, like many other action theorists, sought a constitutive explication of guidance exclusively by way of armchair reflection. Here I depart from Frankfurt.[6] I doubt that armchair reflection alone will yield a sufficiently full explication of guidance. But, to be sure, I also do not want to suggest that we should merely discard our concepts in favor of scientific notions. I rather hope to integrate philosophical armchair reflection methods with reflection on scientific knowledge. My method will rely on intuitively clear cases of individuals’ guidance, psychology’s description of these processes, and armchair reflection on marks of our concept guidance. I argue that reflection on all three items supports my explication Guidance of Visual Attention.[7] I will have achieved my methodological aim if my reader acknowledges that a serious engagement with empirical research can yield interesting, new philosophical insights.[8]

I begin, in section 2, by briefly explaining how recent accounts of the nature of agency fail to meet plausible constraints on any such account. The paper’s main argument will be presented in section 3. It establishes that central executive control realizes primates’ guidance of visual attention. In section 4 I conclude with reflections on the constitution of agency.[9]

2 Recent Action Theory

When Frankfurt introduces the challenge of explaining guidance, he in effect formulates what I will call the comprehensiveness-constraint and the attributability-constraint on an account of agency.[10] The six most influential accounts of agency fail to meet these constraints.

The contrast between guided and non-guided episodes can be found across species. This observation motivates the


An account of agency should explain all instances of agency (across kinds of agents and species).

An account of agency should not narrowly focus on sophisticated human forms of acting. Much recent action theory fails to meet this constraint.[11] Reflection-based accounts of agency take reflective thought to be constitutive of agency. For an event to be an action, the event must be caused by an intention (or a desire, or a reason) and a higher-order, reflective state that endorses acting from that intention. Thus says Velleman: “action is activity regulated by … reflective understanding.”[12] Responsibility-based accounts contend that an event is an action just in case the individual is answerable for the event (in some specific way). Anscombe, for example, suggests that intentional actions are “actions to which a certain sense of the question ‘Why?’ is given application, [namely the sense] in which the answer, if positive, gives a reason for acting.”[13] Intention-based accounts have it that an event is an action just in case it is caused by individuals’ intention to act. Intentions are propositional states (or thoughts) – possible conclusions of practical reasoning. This approach stems from Donald Davidson, who claims that someone “is the agent of an act if what he does can be described under an aspect that makes it intentional.”[14] For Davidson, an event is intentional just in case a belief and a pro-attitude of the agent caused the event.

These proposals hold no promise to account for agency across species. A lioness acts when she stalks her prey. Birds act when they build their nest. A shark acts when mating. An ant acts when it walks back to its hive. These animals surely do not engage in reflection on their own mental lives. These animals also do not ask why-questions in the way humans do. These animals do not engage in any form of reason-giving. And while some mammals may be capable of forms of practical reasoning and hence may have propositional intentions, it is an empirical question whether birds and fishes have them, and highly unlikely that arthropods do. Even for human agency, the accounts seem to over-intellectualize. It seems implausible that humans guide all their acts by reflecting upon them. Some human agency may not even be guided by propositional intentions. Many human acts are highly over-learned or reflexive. Think of tying your shoelaces, weaving a jab, or walking down the street. Usually individuals do not reflect on their intention to tie their laces. It is unclear whether individuals must propositionally intend to weave the jab, or to walk there in order to perform these actions. Again, these issues are open to empirical inquiry.

These first three accounts of agency promise at best to capture forms of relatively sophisticated human agency. Goal-based accounts hold more promise for meeting the comprehensiveness-constraint. On these accounts, for an event to be an action, it must be caused by a state that represents a goal. This state need not involve reflection. It need not be a propositional intention. Instead, the goal-representing state might be a more primitive, image-like state. Thus Alfred Mele acknowledges that actions “may be partially accounted for by low-level representations of … various trajectories of ‘to-be-performed’ movements.”[15] Mele allows that such representations may be (elements of) action-guiding states. Many animals have perceptual capacities. They should hence be able to guide their acts through an image-like representation of the act’s goal. The same kind of image-like goal-representation may guide non-sophisticated human acts.

All extant such accounts, however, fail to meet the


An account of agency should explain under which conditions the guidance of a process is attributable to the whole individual.

The attempt to find “a place for agents in the explanatory order of the world” motivates this constraint.[16] The constraint has two components. First, accounts of guidance must distinguish, in a principled way, between states and events that occur at the level of the whole individual, and those that occur at the level of an individual’s sub-systems alone.[17] Actions and perceptions are individual-level events and states. Many processes in the circulatory or visual systems, for example, occur at the level of those sub-systems alone. The circulatory system, not the individual, transports nutrients. The visual system, not the individual, computes edge-representations. The same distinction can be made for the case of action. Performing any motor action involves computations over a vast number of perceptual and motor states. Suppose that an individual guides her hand to pick up an apple. Visual information about the apple integrates with proprioceptive information about the position of different parts of arm and hand. The motor system computes a representation of arm and hand’s goal-states. If the individual’s finger was straight, a representation may specify its goal-state as being bent at some angle. If the wrist was relaxed, a representation may specify the degree to which it must be turned.[18] Many of these image-like, goal-representing states occur only at the level of the individual’s motor system. They cannot be attributed to the whole individual. Since they are not the individual’s events or states, they cannot plausibly constitute her guidance of the movement. Goal-based accounts should distinguish sub-individual goal-representing states from individual-level constitutors of guidance.

Second, accounts of guidance must distinguish, in a principled way, between individual-level states and events that merely causally influence a process, and those that constitute the individual’s guidance of an act. An image-like, goal-representing state, a propositional intention, or a reflective, higher-order commitment may each dispose an individual to pick up an apple. An image-like representation of an apple, an intention to eat apples, or a higher-order commitment to eat fruit may each increase the likelihood that the individual should choose an apple as her food. But suppose that causation by her individual-level image-like, goal-representation as of this apple constitutes the individual’s guidance of the action. This latter individual-level state’s causal influence on the action must importantly differ from that of the earlier individual-level states, which provide merely a background condition for the individual’s guidance. In virtue of what does the image-like, goal-representing state constitute the individual’s guidance? Goal-based accounts should distinguish between individual-level states that constitute an individual’s guidance, and those that provide a causal background condition for guidance.[19] An account of agency will be incomplete if it does not meet the attributability-constraint.

Agent-causation accounts postulate that an event is an action just in case the individual herself causes the event to occur. Roderick Chisholm writes: “when a person acts … he causes his own undertaking and … there is no event or set of events constituting a sufficient causal condition for that undertaking.”[20] This kind of account meets the attributability-constraint. The account could plausibly be broadened so as to encompass other animals’ acts, and hence meet the comprehensiveness-constraint. However, agent-causation accounts postulate a primitive form of causation – causation by an agent. Agents are substances. Substances are not events. Natural science recognizes different types of event-causation. Natural science does not acknowledge causation by substances. Agent-causation explications have been widely dismissed for conflicting with a scientific world-view. Moreover, agent-causation accounts do not contribute much to explaining under which conditions an individual guides a process. These accounts rather stipulate that whenever an individual guides an act, that individual primitively causes the action. So the accounts do not address the attributability-constraint in a satisfying way.

Trying-based accounts do not appeal to a primitive form of causation. Tryings are agential events that cause, and are parts of, an action. Trying-based accounts contend that an event is an action just in case the individual’s trying caused the event in the right way. Thus writes Brian O’Shaughnessy: “All physical action involves a willing or bringing about of act-neutral bodily events.”[21] Tryings are plausibly individual-level events. It seems possible to specify tryings in such a way that both humans and other animals undergo such events. So trying-based accounts, too, hold some promise for meeting both the comprehensiveness- and the attributability-constraint. But no trying-theorist has ever specified the nature of a trying. Intuitively, one may think of tryings as conscious efforts. Proponents of trying-based accounts tend to reject this conception of a trying. Indeed, if this were the correct account of a trying, then it would be at best unclear whether all animal agents, such as spiders and fishes, do try to act, whenever they act. And it seems questionable whether all human action, such as effortless active inference, involves a trying. But pending some independent specification, tryings are identified merely as agential events that constitute acts. In this form, trying-based accounts do not satisfyingly explain under which circumstances the guidance of a process can be attributed to an individual.

None of the six accounts of agency meet both the comprehensiveness-constraint and the attributability-constraint in a satisfying way. This fact motivates searching for a new account.

3 Guidance of Visual Attention

In this section, I argue for an empirical explication of individuals’ guidance. Later, I will rely on this explication to motivate a new account of agency. I will then also explain how we can hope to address the comprehensiveness- and the attributability-constraint in a more satisfying manner. But for now, my focus is on establishing that:

Guidance of Visual Attention

Central executive control realizes primates’ guidance of their visual attention shifts.

The central executive system coordinates and integrates psychological states and processes from different modalities and different psychological subsystems.[22] The system’s coordination and integration enables many individual-level processes, such as problem solving, goal-represented bodily action, and attention shifts. The system achieves such coordination and integration by deploying its own central processing and storage resources. Central processes involve the transformation of non-modular and intermodal states. This processing is intermodal insofar as the central executive system serves as an interface for integrating information from different modalities and psychological capacities. The processing is non-modular insofar as it accepts a wide range of inputs.[23]The central executive system controls individual-level processes, most fundamentally, by flexibly allocating processing and storage resources to the respective processes. Three executive functions that characterize the system’s functioning are the competencies to switch mental set, maintain relevant memories, and inhibit prepotent responses and distractor stimuli. Switching consists in the process of abandoning one process or task and initiating another. Switching is an exercise of a competence to organize different, competing activities at a time, and over time. Maintenance is the activation and holding active of relevant memories, and of encoding incoming task-relevant information into working memory. Inhibition of prepotent responses and distractors is the process of suppressing irrelevant processes and stimuli. Inhibition consists not merely in withdrawing central resources from processes and stimuli, but in suppressing their influence on the execution of ongoing tasks. By allocating central processing and storage resources, the central executive system determines which information and psychological processes from different psychological sub-systems contribute to the course and outcome of individual-level psychological processes. The central executive system controls individuals’ processes in this way.

I abstract from specific models of the central executive system. I do not think of it as primarily a central processing unit, some specific computational or neural mechanism, or some specific part of the brain. Psychologists identify central executive functions, as I think of them, on the basis of behavioral experiments involving a variety of tasks. There are many promising attempts to integrate this behavioral research with computational modeling and neuroscience. But it seems too early to me to strongly commit to the success of any of the specific attempts.[24]

As I explained earlier, for central executive control to help explicate individuals’ guidance, it must, first, appropriately correlate with individuals’ guidance. For central executive control to appropriately correlate with individuals’ guidance, such control must be absent from processes that individuals do not guide. And it must correlate with individuals’ exercises of guidance, when a process is guided. But second, appeals to this central executive control must deepen our understanding of individuals’ guidance. The argument in this section has, accordingly two large parts. They correspond to two steps in providing an empirical explication. First, in section 3.1, I survey psychological factors that help shift attention and I suggest that only central executive control appropriately correlates with guided processes. Second, in section 3.2, I reflect on the notion individuals’ guidance, and on how appeals to central executive control promote our understanding of that notion.

3.1 Central Executive Control as Correlating with Individuals’ Guidance of Visual Attention

Remember the macaque. The monkey guides his shifts of attention when he is searching for berries. His attention shifts passively when a salient stimulus like a spider captures his attention. What psychological processes underlie these different ways in which individuals shift visual attention?


3.1.1 Captured Attention

What happens when attention is captured? Psychologists distinguish two psychological systems for orienting attention: the exogenous and the endogenous system.[25] The exogenous system exclusively responds to stimuli that are physically salient. The system orients attention reflexively, rapidly. It is very difficult to voluntarily suppress this system’s activity. Intentions, beliefs, expectations often do not influence its activity. The endogenous system’s activity, on the other hand, is not restricted to any specific type of stimulus. Intentions, beliefs, expectations, and goals typically shape its activity.[26] The exogenous and endogenous systems both generate assignments of priority on a priority map. The priority map is a topographical representation of the visual scene that assigns priority values to objects and locations in the scene. Attention shifts to the location with highest priority.[27]

When attention is captured, it shifts passively. Individuals do not guide such shifts. We can distinguish two ways in which visual attention can be captured. First, visual attention can be captured in absence of individuals’ guidance. Suppose that the macaque passively stares at his surroundings. A suddenly appearing spider will capture his visual attention. But, second, the appearance of the spider may also override individuals’ guidance. The macaque, while guiding his visual attention shifts in his search for berries, may have his attention captured by the spider, in spite of his guidance. The capturing stimulus overrides the monkey’s guidance. Attention shifts during instances of attention capture can be fully explained by appeal to the exogenous system’s activity alone.[28] Endogenous factors do not contribute to these shifts. The exogenous system’s activity can hence not be the primary realizer of individuals’ guidance. Such activity does not appropriately correlate with exercises of individuals’ guidance.


3.1.2 Drawn Attention

Many shifts of attention result from the interaction of the endogenous and exogenous system. Factors that do and factors that do not correlate with individuals’ guidance co-occur in such cases. How can we distinguish these different factors? Shifts during visual search typically involve the interaction of the two systems. In visual search, individuals guide their attention shifts toward the goal of their search. When a stimulus captures attention during visual search, the exogenous system overrides individuals’ guidance. When the exogenous system influences, but does not fully override or interrupt endogenously driven shifts, I will say that attention is drawn by a stimulus.

Suppose that the subject in an experiment searches for a green diamond. It so happens that the green diamond is also the only green item in the display. So the green diamond is the physically salient stimulus in the display. Folk et al. suggest that the individual’s goal to find the green diamond may tune the exogenous system to be more sensitive, and react more rapidly to, green items.[29] The increased sensitivity to the salient item – which is also the target of the search – enhances the individual’s search. Individuals become faster and more accurate at shifting attention to the diamond.[30] This increased sensitivity is due to the individual’s goal to find the diamond. Despite the contribution to these shifts by the exogenous system, we recognize the individual as guiding these shifts. But also, clearly, the contribution of the exogenous system to these shifts is not the factor that appropriately correlates with the individuals’ guidance. The salient stimulus may equally draw attention away from the target’s location. The stimulus may thus interfere with the individual’s search. Suppose that a physically salient green diamond attracts attention away from the search target, a red circle, say. But suppose also that the distractor diamond does not capture attention. The distractor merely deflects the attention shift, bends the trajectory of the shift, without ever fully disrupting the shift. Despite the exogenous system’s interference, the individual still guides her shift toward the search target. Godijin and Theeuwes identify different types of ‘bent’ trajectories for attention shifts that result from the exogenous system’s interference with attention shifts during visual search.[31] Again, the exogenous system’s influence clearly does not appropriately correlate with the factor that realizes guidance.

For both types of drawn attention, we recognize the exogenous system’s influence as not correlating with individuals’ guidance. Cases of interference with individuals’ search are especially clear in this respect. Whatever factor realizes individuals’ guidance does not normally also interfere with it. When we intuitively recognize some psychological factor as interfering with an individual’s guidance of a process, we have reason to think that that factor cannot be the correlate of individuals’ guidance. In the next section I will appeal to drawn attention in order to rule out a range of endogenous factors as realizers of guidance.


3.1.3 Shifts Drawn by Non-Guiding Endogenous Factors

Psychologists consider an extremely wide range of factors to be ‘endogenous.’ Not all these factors plausibly realize the individual’s guidance. Some of these factors, too, draw attention. For example, individuals can be primed to be more sensitive and react more readily to certain stimuli. If an individual repeatedly searches for a green diamond, she will get faster at finding green diamonds. Once primed for green diamonds, the individual will find it more difficult to ignore green diamonds during subsequent searches – even if now she is looking for a red circle. Priming can draw attention to some type of stimulus.[32] Also, individuals store statistical information about configurations in visual scenes. When a search target consistently appears at a certain distance and angle from, say, a heptagon of vertical bars, individuals more readily shift attention to a location at the same distance and angle from such a configuration. The location attracts the individual’s attention even if the configuration is no longer indicative of the search target. The memory for the configuration draws attention to that location.[33] We obtain similar effects due to associations of past reward with types of stimuli, and due to large amounts of information about objects and scenes stored in long-term memory.[34] Even working memory can draw attention. Suppose that an individual maintains a red circle in visual working memory. This memory may then influence subsequent search for a green diamond. Suppose that an individual is told to search a display that contains the target, a green diamond, but also a range of distractors. Suppose that one of the distractors is a red circle, matching the representation maintained in visual working memory. Soto et al. provided experimental evidence that the distractor will draw the individual’s guided shifts during her search.[35] Search times increase and the shifts decrease in accuracy. The influence of the red circle stored in working memory interferes with the individual’s search. Clearly, the memory of the red circle does not realize the individual’s guidance.

All these endogenous factors draw attention. We intuitively recognize them as not realizing the individual’s guidance. We furthermore recognize them as interfering with individuals’ guidance. These factors hence do not plausibly correlate with individuals’ guidance. Which endogenous factor plays this role?


3.1.4 Central Executive Control as the Actual Correlate of Guidance

Psychological research suggests that central executive control appropriately correlates with exercises of individuals’ guidance. A study by Olivers and Eimer[36] illustrates this research. They performed experiments similar to those sketched in the last sub-section. In their studies, too, individuals had to memorize a color, perform a visual search, and a memory test. The memorized color influenced attention shifts during a subsequent search. When their subjects could not predict whether they would first complete the search or the memory test, individuals pursued both goals simultaneously. Olivers and Eimer found that the effect of the memorized color on subsequent search doubled, relative to the condition in which individuals could predict which task they would have to complete next. Olivers and Eimer explained this result as due to the allocation of central resources to both representations in working memory. When individuals guide their search on the basis of a representation of the search-goal in working memory, then their central executive system allocates central resources to that representation. This allocation increases the representation’s influence on the priority map and hence subsequent search. When working memory interferes with search, no central resources activate the memory as a search goal. Experiments of this type support the idea that the central executive system’s allocation of resources to some specific process correlates with individual’s guidance of attention shifts to locations.

There is converging evidence for all three signature executive functions in visual search. Switching of mental set is required to abandon a task and initiate a visual search. Walther and Fei-Fei showed that visual search exhibits typical effects of switching.[37] They asked subjects to switch back and forth between the task of searching for a target in a display, and that of reporting the color of the display’s frame. The central executive system takes between 200 ms and 800 ms to switch tasks. Individuals’ performance on the second task was only impaired when they had less than 200 ms to switch sets. These experiments support the claim that initiating a visual search requires switching. Maintenance of relevant representations in memory is required for the execution of visual search. For example, Oh and Kim showed that when individuals had to memorize locations of four squares on a screen, a subsequent visual search slowed down.[38] The individuals were less effective at finding the search target. Oh and Kim explained their results by pointing out that these individuals’ visuo-spatial working memory was filled to capacity. If the memory storage for locations is filled, the central executive cannot effectively control assignments of priority on the priority map. Inhibition of irrelevant distractors is required to search a cluttered display. Lavie et al. found that individuals shift attention to distractors more often when the individuals have to concurrently generate random numbers or perform calculations on numbers.[39] The amount of time that they needed to find the target increased in proportion with the amount of unrelated central processing individuals had to carry out. Lavie et al. thus showed that the central executive system is needed, in visual search, to inhibit the effects of distractors.

This empirical research supports the claim that whenever primates guide their attention shifts, then these individuals’ central executive systems control the shifts of visual attention. Earlier I argued that the central executive system does not control shifts during attentional capture. Those shifts are entirely controlled by the exogenous system. I have also argued that the endogenous factors that draw attention are not exercises of central executive control. So exercises of central executive control plausibly correlate with primates’ guidance of visual attention.

How does the central executive system realize individuals’ guidance of attention shifts during visual search? Remember, again, the macaque’s visually searching for a berry. The monkey guides his attention during his search. When the monkey sets out to find a berry, the central executive system activates an iconic, picture-like memory of the berry. The central executive system implements the monkey’s goal by prioritizing the memory of the berry as the template for computations of priority on the priority map. This input governs the computations of priority for shifting attention to specific locations. When the monkey begins his search, the central executive system initiates computations that yield a priority assignment for possible target locations on the priority map. The computational mechanisms underlying the priority map prioritize locations on the basis of how visually similar the target, stored in memory, is to items at locations in the visual scene. Suppose that there was no berry where the monkey shifted attention first. When the monkey shifts attention to the next location the priority map represents this new location as the next-most likely location for the target. The process repeats until the search terminates. When the monkey finds the berry, the visual attentional system computes a sufficient match between the item at a searched location and the iconic memory of the berry activated by the central executive system. The central executive system may now switch mental set from the goal of finding the berry to that of reaching for it.

I conclude that there is strong empirical support for the claim that central executive control correlates with primates’ guidance of attention shifts.

3.2 Central Executive Control as Explanatory of Primates’ Guidance of Visual Attention

An explication must help ground explanations of what it explicates. I now want to reflect on a way in which my explication furthers our understanding of the notion of guidance. Appeal to central executive control helps deepen our understanding of individuals’ guidance by making the concept shaper, more precise. Reflection on the notion individuals’ guidance suggests several marks that we associate with this notion. Such marks do not constitute necessary or sufficient conditions on guidance. Rather, these marks represent salient aspects of central instances of individuals’ guidance. We can see how appeal to central executive control helps explain individuals’ guidance, if we can see that central executive control can realize salient aspects of individual’s guidance. Appeal to the scientific explanation of these aspects’ realization sharpens the notion’s pre-theoretical marks.

What is an everyday example of a good guide? Consider the example of a mountain guide. First, the good guide sets the goal for the expedition’s ascent. She picks a peak as her expedition’s destination and orients her expedition toward the peak. Second, the good mountain guide has relevant organizational competencies. She brings the map, rope, pickax, and crampons. She knows how to use them. The good mountain guide is able to negotiate the terrain that the expedition crosses. She knows the route to that peak. Third, a good mountain guide uses these competencies, as needed, to stably steer toward the goal of the expedition. She gets the crampons out when the expedition crosses ice. If necessary, she quells resistance on a stretch that strikes her companions as unduly strenuous. Appeals to central executive control help us understand individuals’ guidance by connecting it with a kind that appears in psychological explanations. Reflection on this psychological kind reveals how central executive control realizes the pre-theoretic marks of the notion guidance laid out in the foregoing paragraph. Such reflection yields empirical conditions that realize aspects of guidance, and hence plausibly guidance itself, in actual psychologies. Central executive control realizes the individuals’ pursuit of a goal. The system organizes competencies and resources for attaining the goal. And it exercises these competencies, as needed, to stably steer toward that goal. Remember the case of visual search. First, the central executive system implements the individual’s goal by activating a representation in working memory. Second, the central executive system realizes organizational competencies for attaining the individual’s goal insofar as it activates or holds available storage and processing resources. Third, the central executive system, when functioning well, exercises its executive functions – that is, allocates resources – in ways that further the individual’s goal-directed activity.

Psychology can help explain guidance’s marks by appeal to central executive functions – switching, maintenance, and inhibition – that characterize the central executive system. Psychology offers fairly sharp criteria that characterize each executive function and confirm its exercise. The scientific notions switching, maintenance, and inhibition are certainly more sharply characterized than our pre-theoretic marks of individuals’ guidance. Appeal to these scientific notions in explanations of the marks of guidance hence helps sharpens the latter notion.

Central executive control furthers our understanding of individuals’ guidance in similar fashion. Appeal to central executive control helps illuminate conditions for attributing guidance to individuals, rather than their subsystems alone. We should first note that the central executive system is a prima facie plausible candidate for realizing guidance by the individual, because the central executive system helps realize instances of processes that are attributable to the whole individual, such as instances of problem solving, inference, and attention shifts. Moreover, the central executive system functions to integrate and coordinate individuals’ states and competencies for the successful execution of individual-level processes. The system organizes and activates resources for a process. The system coordinates individual-level processes both simultaneously and through time. By integrating capacities for an individual-level process, the central executive system unifies the activities and products of the individual’s different psychological sub-systems and enables a process at the level of the whole individual. In particular, the central executive system can perform these functions for attaining individuals’ goals. The central executive system has a function to implement individuals’ goal-represented processes. The integration and coordination achieved by the central executive system serves, in central cases, the attainment of individuals’ goals. In explanations of many other processes, psychology does not appeal to goals of the individual. Computations of edge-representations from representations of illumination gradients, for example, are normally explained in terms of transformations by the visual system, without influence from individuals’ goals. Central executive control thus marks a level of integration and coordination that helps unify psychological processes in the individual’s sub-systems so as to generate the individual’s activities, directed at the individual’s goals. This system contributes to unifying psychological processing in different psychological sub-systems so as to make them an individual’s goal-represented activities. Because central executive control marks such a level of integration and coordination, the central executive system is a plausible realizer of individuals’ guidance.

Psychology’s explanations of processes in terms of central executive functions are more rigorous and precise than our pre-theoretic notion of individuals’ guidance. As before, appeal to these scientific notions in explaining marks of individuals’ guidance hence yields a sharpening of the latter notion. So we can see how appeal to an empirical condition helps deepen our understanding of guidance by the individual by making the concept shaper, more precise.

3.3. Executive Control as Explicating Primates’ Guidance of Visual Attention

I have argued that central executive control correlates with primates’ guidance. I have also argued that appeal to central executive control helps us better understand guidance. I conclude that

Guidance of Visual Attention

Central executive control realizes primates’ guidance of their visual attention shifts.

My argument appealed to three factors. It was driven by our intuitions about cases. I also appealed to pre-theoretic marks of our concept guidance. Finally, I appealed to our scientific knowledge of the processes that occur when individuals guide their acts. I did not merely suggest replacing our pre-theoretic notion with a scientific notion. Nor did lay down pre-theoretic marks of guidance as armchair criteria for any mechanism that is to realize individuals’ guidance. My claim is rather that, taking into account all three factors, our best explanation of primates’ guidance of visual attention appeals to central executive control. We can explicate individuals’ guidance by appeal to this explanation.


4 The Constitution of Agency

Earlier I promised that I would use my empirical explication of guidance to motivate a novel account of agency. I promised that I would sketch how such an account might help us address the comprehensiveness- and the attributability-constraint. I want to return to these promises now. An account of agency provides constitutive conditions on the kind agency. Constitutive conditions on a kind are “conditions that are necessary, sufficient, or necessary and sufficient to be something of [a] kind or with [a] nature, and [they] are in principle potentially relevant to explaining, understanding, illuminating the kind or nature.”[40] In what way does Guidance of Visual Attention help uncover constitutive conditions on agency?

4.1 Guidance in Primates

I begin by motivating an account of guidance in primates. I have argued that appeals to central executive control help us explicate individuals’ guidance. There is ample evidence that the central executive system controls primates’ acts in many other cases, including central instances of mental and bodily action.[41] The evidence makes it plausible that central executive control correlates with all exercises of primates’ guidance in the actual world. The latter observation, in turn, lends plausibility to the claim that

Guidance in Primates

Central executive control (partly) constitutes primates’ guidance.

Suppose that the meaning of concepts such as agency and individuals’ guidance is partly determined by relations to their denotation, just as in the case of water and life. Just as in these latter two cases, providing a constitutive explication for agency and individuals’ guidance would accordingly require investigating the respective kinds in the actual world. Further suppose that central executive control actually correlates with primates’ guidance. Pending considerations to the contrary there would then be some reason to think that central executive control constitutes a necessary condition on primates’ guidance. Each of these claims requires extensive support than I cannot provide here. But they do have some antecedent plausibility. Earlier I explained how appeal to central executive control deepens our understanding of individuals’ guidance.[42] The same considerations apply to Guidance in Primates. If central executive control is a necessary condition on primates’ guidance, and if appeal to central executive control indeed deepens our understanding of primates’ guidance, then central executive control is a constitutive condition on such individuals’ guidance. So there is some reason to think that central executive control partly constitutes primates’ guidance.

4.2 Addressing the Attributability-Constraint

This constitutive explication of primates’ guidance would constitute progress over recent action theories as regards the attributability-constraint. This constraint requires explaining under which conditions the guidance of a process is attributable to the whole individual, as opposed to merely one of her sub-systems. The central executive system functions to integrate and coordinate activity in different psychological sub-systems so as to yield individual-level processes. Appeal to this system plausibly helps us understand why the guidance of a process is attributable to the whole individual, not merely one of her sub-systems.

Guidance in Primates hence does justice to the basic insight of agent-causation accounts of agency: that in order to explain agency we must acknowledge the agent’s role in acting. The explication does so without being committed to an unscientific form of causation. A mark of the agent’s involvement in her act would be the exercise of central executive control. Guidance in Primates might be seen as a way of specifying the nature of tryings. We might consider a trying to be the allocation of central executive resources to some process. If the main problem with trying-based accounts was that they do not specify the nature of tryings, my proposal constitutes progress. The appeal to empirical science constitutes genuine explanatory progress in both cases.

Guidance in Primates further promises a solution for the difficulties facing the reflection-based, intention-based, and goal-based accounts of agency. It seems likely that sometimes, we guide our acts through reflection. At other times, plausibly, a thought or propositional intention guides our doings. And equally, it seems plausible that under certain circumstances an image-like representation alone can implement individuals’ guidance. Under what circumstances do the respective states realize an individuals’ guidance? When do reflection, intentions, or goal-representations merely causally influence an individual’s acts? When do image-like goal-representing states contribute to an action at the sub-individual level alone? Guidance in Primates suggests an answer to these questions. In visual search we have seen that the central executive system controls processes by activating an image-like search goal in working memory. The activation of the search goal consists in the central executive system’s allocation of central resources to that search goal. We might say that the individual then guides his search through an image-like goal-representation. The allocation of central executive control to goal-representation and search process marks that representation’s role as a guider of the search. Absent such allocation of central resources, the goal-representation at best causally influences the individual’s guidance. If a goal-representing state is beyond the central executive system’s reach, that state may occur at the level of a sub-system alone.[43] And a similar explanation is now available for cases in which an individual’s reflection or a propositional intention helps guide her act. We might say that the individual guides his search through reflection or intention whenever her central executive system activates – allocates resources to – a reflective state or an intention in similar ways. What unifies different exercises of agency, on this picture, is the central executive system’s control. What distinguish different exercises of agency are the capacities and states through which an act is guided.

In these ways, Guidance in Primates promises to contribute to understanding responsible agency. As I said earlier, not all primates are answerable for their acts. Responsibility-based accounts hence misidentify what constitutes agency. But they do draw attention to a particularly important type of agency: agency that individuals are answerable for. Central executive control is plausibly not sufficient for holding individuals responsible for their acts – monkeys will hardly be held responsible for their doings. But central executive control may well be necessary for the kinds of criticisms and assessments that the responsibility-based accounts invoke to explicate agency. By addressing the attributability-constraint in the way sketched, my proposal promises to integrate important lessons from all major accounts of agency.

5.3 Addressing the Comprehensiveness-Constraint

Appeal to central executive control will most likely not, however, help us meet the comprehensiveness-constraint. This constraint requires that an account of agency should endeavor to explain all instances of agency. An account of agency should at least hold some promise to account for exercises of agency across all species. Guidance in Primates promises to account for guidance in a much wider range of animals than most of the proposals that have recently been made in action theory. We know that primates have a central executive system. Many animals, such as mammals and possibly birds, exhibit behavior that is best explained by appeal to executive functions.[44] But for many other actual agents it is at best unclear whether they have a central executive system. We do not know, at this point, whether the behavior of sharks and spiders exhibits the three executive functions switching, maintenance, and inhibition. Animals such as jellyfish, worms, and ciliates may not even have psychologies.[45] So it seems that these animals could not have the psychological competencies that characterize the central executive system. Reflection on guidance in primates does, however, motivate a conjecture about the constitution of agency, which respects the comprehensiveness-constraint:[46]

Constitution of Guidance

The control of a process by an individual’s central structures (partly) constitutes that individual’s guidance.

‘Central structures’ are structures that function to integrate and coordinate the activity of individuals’ sub-systems. Such structures ‘control’ a process if they actually determine and shape an individual-level process by integrating and coordinating the activity of sub-systems. The conjecture leaves open whether such central structures are psychological structures or of some other nature. The conjecture hence involves no commitment to specific psychological competences on an agent’s part. The conjecture allows for agents that do not have a psychology at all. Indeed, the conjecture allows that the nature of the relevant central structures and the ways in which they control activity may differ from kind of agent to kind of agent. So the conjecture meets the comprehensiveness-constraint in that it at least promises to apply to all agents, across species.

Why might we think that such central structures’ control might constitute individuals’ guidance? First, reflection on actual agents provides initial support for the claim. Attributions of a central executive system to primates, possibly mammals, and maybe even birds, seem empirically plausible. Mammals and birds certainly seem to have a psychological system functioning as an integrator and coordinator of these animals’ activities. Central structures of sorts can be found in arthropods, cephalopods, and many fish. One may even argue that the membrane of single-cell organisms such as ciliates constitutes a central structure.[47] A cursory overview of actual agents at least suggests that they all have central structures of the kind that the conjecture appeals to.

Second, reflection on the evolution of actual agents provides some support for the claim that they all have central structures. The agents that we know seem to exhibit some degree of complexity. These animals have different types of behaviors available. They typically are capable of processing different types of stimulation. Which stimulus they process and which behavior they exhibit typically depends on their internal state. From an evolutionary point of view, it would seem beneficial for these agents, if they had central structures that integrate and coordinate stimuli from different sources with their internal state. The agents would seem to benefit from these structures’ selecting some specific type of behavior on that basis. So it would seem beneficial if these animals had central integrating and coordinating structures.

Third, armchair reflection suggests that agents should exhibit a certain unity. Actions are doings by agents. Many agents – and plausibly all actual agents – draw upon different capacities for their actions. Such agents draw upon different behavioral abilities, different perceptual capacities, and different types of motivation. In virtue of what are these different processes unified so as to yield the agent’s act? It seems plausible that some central structure that coordinates and integrates the activities of relevant sub-systems should yield this unification. Different sub-systems can contribute to each of an individual’s acts. But the contributions of different sub-systems would be coordinated and integrated by the same central structure, in each act. Appeal to such a central structure would help us explain a sense in which agents exhibit unity.

I offer Constitution of Guidance as a conjecture about the constitution of agency. I hope to verify the conjecture in future work.

5 Conclusion

In this paper I have argued that we can explicate individuals’ guidance by appeal to central executive control. I have suggested that exercises of primates’ guidance constitutively involve central executive control. And I have conjectured that control by integrating and coordinating central structures is a constitutive condition on all agency. I have developed my explication(s) by reflecting on a close examination of empirical research on episodes in which individuals guide their acts. I thus hope to have illustrated the benefits of such a methodology in action theory.




Anderson, B. 2013. “A Value-Driven Mechanism of Attentional Selection.” Journal of Vision 13(3): 1-16

Anscombe, G.E.M. 1953. Intention. Cambridge, MA: Harvard University Press

Baddeley, A. D. 2007. Working Memory, Thought, and Action. Oxford: Oxford University Press.

Barth, F. 2002. A Spider’s World. Senses and Behavior. Springer: Heidelberg.

Bishop, J. 1989. Natural Agency. An Essay on the Causal Theory of Action. Cambridge: Cambridge University Press.

Block, N. & Stalknaker, R. 1999, ”Conceptual Analysis, Dualism, and the Explanatory Gap.” Philosophical Review 108: 1-46

Brand, M. 1984. Intending and Acting. Cambridge, MA: MIT Press.

Bratman, M. 1987. Intentions, Plans, and Practical Reason. Cambridge, MA: Harvard University Press.

Buehler, D. [MS]. “Animal Agency.”

Buehler, D. [MS]. “Unsophisticated Agency.”

Buehler, D. [MS]. “The Individual/Sub-System Distinction.”

Buehler, D. 2014, Psychological Agency – Guidance of Visual Attention, UCLA Dissertation

Burge, T. 1979. “Individualism and the Mental.” in T. Burge (2007), Foundations of Mind (pp. 392-419), OUP: Oxford

Burge, T. 2007, Foundations of Mind, OUP: Oxford

Burge, T. 2009, “Primitive Agency and Natural Norms,’ Philosophy and Phenomenological Research 79: 251-287

Burge, T. 2010, Origins of Objectivity, OUP: Oxford

Burge, T. 2010a. “Steps Towards Origins of Propositional Thought.” Disputatio 4(29): 39-67

Carnap, R. 1947, Meaning and Necessity, UCP: Chicago

Carrasco, M. 2011. “Visual Attention: The Past 25 Years.” Vision Research (51): 1488

Chisholm, R. 1976. Person and Object. A Metaphysical Study. La Salle, IL: Open Court Publishing.

Chisholm, R. 1978. “Comments and Replies.” Philosophia 7: 597-636

Chun, M. & Nakayama, K. 2000. “On the Functional Role of Implicit Visual Memory for the Adaptive Deployment of Attention across Scenes.” Visual Cognition 7(1/2/3): 65-81

Davidson, D. 1980. Essays on Actions and Events. Oxford: Oxford University Press.

Emery, N. & Clayton, N. 2004. “Comparing the Complex Cognition of Birds and Primates.” in L. Rogers & G. Kaplan, Comparative Vertebrate Cognition (pp. 3-55), Springer: New York

Folk, C., Remington, R. & Johnston, C. 1992. “Involuntary Covert Orienting is Contingent on Attentional Control Settings.” Journal of Experimental Psychology: Human Perception and Performance 18(4): 1030-1044

Frankfurt, H. 1978, “The Problem of Action,” in his The Importance of What We Care About, CUP: New York (1988)

Gazzaley, A. & D’Esposito, M. 2006. “Unifying the Prefrontal Cortex: Executive Control, Neural Networks, and Top-Down Modulation.” In B. Miller & J. Cummings (eds.), The Human Frontal Lobes. Functions and Disorders. (pp. 187-207) New York: Guilford Press.


Geisler, W. & Cormack, L. 2011. “Models of Overt Attention.” In S. Liversedge, I. Gilchrist & S. Everling (eds.), The Oxford Handbook of Eye Movements (pp. 439 – 454), OUP: New York

Godijn, R. & Theeuwes, J. 2003. “The Relationship between Exogenous and Endogenous Saccades and Attention.” In J. Hyona, R. Radach & H. Deubel (eds.), The Mind’s Eyes: Cognitive and Applied Aspects of Eye Movements (pp. 3-26), Elsevier: Amsterdam

Goldman, A. 1970. A Theory of Human Action. Princeton, NJ: Princeton University Press

Güntürkün, O. 2005. ”The Avian ’Prefrontal Cortex’ and Cognition.” Current Opinion in Neurobiology 15: 686-693

Hieronymi, P. 2009. “Two Kinds of Mental Agency.” In Lucy O’Brien & Matthew Soteriou (eds.), Mental Actions, Oxford: Oxford University Press.

Hollingworth, A. 2014. “Guidance of Visual Search by Memory and Knowledge.” In M. Dodd & J. Flowers (eds.), The Influence of Attention, Learning, and Motivation on Visual Search, Nebraska Symposion on Motivation

Hornsby, J. 1980. Actions. London: Routledge.

Kripke, S. 1980. Naming and Necessity. Harvard University Press: Cambridge, MA

Kristjansson, A. & Campana, G. 2010. “Where Perception Meets Memory: A Review of Repetition Priming in Visual Search Tasks.” Attention, Perception & Psychophysics 72(1): 5-18

Korsgaard, C. 2009. Self-Constitution. Agency, Identity, and Integrity. Oxford: Oxford University Press.

Lavie, N., Hirst, A., De Fockert, J. & Viding, E. 2004. “Load Theory of Attention and Cognitive Control.” Journal of Experimental Psychology: General 133(3): 339-354

Lavie, N. & Dalton, P. 2014. “Load Theory of Attention and Cognitive Control.” In A. Nobre & S. Kastner (eds.), The Oxford Handbook of Attention (pp. 56-75), Oxford: Oxford University Press.

Leonard, C. & Egeth, H. 2008. “Attentional Guidance in Singleton Search: An Examination of Top-Down, Bottom-Up, and Intertrial Factors.” Visual Cognition 16(8): 1078-1091

McDowell, J. 1994. Mind and World. Cambridge, MA: Harvard University Press.

McDowell, J. 2007. “What Myth?” Inquiry 50(4): 338-351

McDowell, J. 2010. “What is the Content of an Intention in Action?” Ratio 23: 415-432

Mele, A. 1992. The Springs of Action. Understanding Intentional Behavior. Oxford: Oxford University Press

Miller, E. & Cohen, J. 2001. ”An Integrative Theory of Prefrontal Cortex Function.” Annual Review of Neuroscience 24: 167-202

Miyake, A., Friedman, N., Emerson, M., Witzki, A., Howerter, A. & Wager, T. 2000. “The Unity and Diversity of Executive Functions and Their Contributions to Complex “Frontal Lobe” Tasks: A Latent Variable Analysis.” Cognitive Psychology 41: 49-1

Munakata, Y., Hard, S., Chatham, C., Depue, B., Banich, M. & O’Reilly, R. 2011. ”A Unified Frameworke for Inhibitory Control.” Trends in Cognitive Sciences 15: 453-459

Munakata, Y., Snyder, H. & Chatham, C. 2012. ”Developing Cognitive Control: Three Key Transitions,” Current Directions in Psychological Science 21(2): 71-77

Nanay, B. 2014, Between Perception and Action, OUP: Oxford.

Nixon, M. & Young, J. 2003. The Brains and Lives of Cephalopods. OUP: Oxford.

Oh, S. & Kim, M. 2004. “The Role of Spatial Working Memory in Visual Search Efficiency.” Psychonomic Bulletin & Review 11(2): 275-281

Olivers, C. & Eimer, M. 2011. “On the Difference between Working Memory and Attentional Set.” Neuropsychologia 49: 1553-1558

Olivers, C., Peters, J., Houtkamp, R. & Roelfsma, P. 2011. “Different States in Visual Working Memory: When it Guides Attention and When it does Not.” Trends in Cognitive Sciences 15(7): 327-334

O’Reilly, R. & Frank, M. 2006. ”Making Working Memory Work: A Computational Model of Learning in the Frontal Cortex and Basal Ganglia.” Neural Computation 18: 283-328

O’Shaughnessy, B. 1973. “Trying (as the mental “pineal gland”)” The Journal of Philosophy 70(13): 365-386

O’Shaughnessy, B. 1980. The Will. A Dual Aspect Theory. Cambridge: Cambridge University Press.

Peacocke, C. 2007. “Mental Action and Self-Awareness (I).” In J. Cohen & B. McLaughlin (eds.), Contemporary Debates in the Philosophy of Mind, Oxford: Blackwell.

Pears, D. 1975. “Sketch for a Causal Theory of Wanting and Doing.” In D. Pears, Questions in the Philosophy of Mind (pp. 97-141), London: Duckworth.

Posner, M. 1980. “Orienting of Attention.” Quarterly Journal of Experimental Psychology 32

Putnam, H. 1970. “Is Semantics Possible?” In H. Putnam (1975), Mind, Language, and Reality. Philosophical Papers Vol. 2 (139-152), CUP

Quine, W.V.O. 1960, Word and Object, MIT Press: Cambridge, MA

Searle, J. 1983. Intentionality. An Essay in the Philosophy of Mind. Cambridge: Cambridge University Press.

Shadmehr, R. & Wise, S. 2005. The Computational Neurobiology of Reaching and Pointing. A Foundation for Motor Learning. Cambridge: MIT Press.

Sleigh, M. 1975. The Biology of Protozoa. Edward Arnold: London.

Soto, D., Humphreys, G. & Heinke, D. 2006. “Working Memory can Guide Pop-Out Search.” Vision Research 46: 1010-1018

Soto, D., Hodsoll, J. Rotshtein, P. & Humphreys, G. 2008. “Automatic Guidance of Attention from Working Memory.” Trends in Cognitive Science 12(9): 342-348

Stokes, M. & Duncan, J. 2014. “Dynamic Brain States for Preparatory Attention and Working Memory.” In A. Nobre & S. Kastner (eds.), The Oxford Handbook of Attention (pp. 152-182), Oxford: Oxford University Press

Theeuwes, J. 1991. “Exogenous and Endogenous Control of Attention: The Effect of Visual Onsets and Offsets.” Perception & Psychophysics 49(1): 83-90

Todd P., Hills, T. & Robbins, T. (eds.) 2012, Cognitive Search: Evolution, Algorithms, and the Brain, MIT Press: Cambridge, MA

Velleman, D. J. 2000. The Possibility of Practical Reason. Oxford: Oxford University Press

Walther, D. & Fei-Fei, Li 2007. “Task-Set Switching with Natural Scenes: Measuring the Cost of Deploying Top-Down Attention.” Journal of Vision 7(11): 1-12



[1]           My special thanks to Tyler Burge. I have benefitted from comments by Harry G. Frankfurt, Pamela Hieronymi, Michael Rescorla, Miguel Ángel Sebastián, James Stazicker, and participants of the UCLA Mind and Language Workshop and UNAM-IIF’s TEC discussion group. Thanks also to audiences at UCLA, Indiana University Bloomington, and at the National Research University in Moscow.

[2]           I indicate representational contents by underlining them.

[3]           Frankfurt, H. 1978

[4]           Burge, T. 2007, 163ff.; Cf. also Carnap, R. 1947; Quine, W.V.O. 1960, Chapter V.

[5]           Here are two examples of empirically grounded explications that deepen conceptual understanding: Some contracts are oral and Whales are mammals. In both cases, the explication of the concepts provides a condition on being the relevant kind contract or whale. In both cases, the relevant explication depends on empirical information about whales and contracts for its warrant.

[6]           Harry Frankfurt (personal communication) agrees that the lack of progress for the exclusively armchair methodology suggests appeal to empirical knowledge as a viable alternative.

[7]           James Stazicker helpfully suggested describing the method as an attempt to achieve a reflective equilibrium between intuitions about cases, armchair reflection on the concept, and empirical research.

[8]           This methodology has led to major advances in other parts of philosophy, especially in the philosophy of perception. Cf. Burge, T. 2010. So far, we lack an application of this methodology in action theory.

[9]           A longer version of this paper contains a section addressing several objections.

[10]          Frankfurt, H. 1978

[11]          For the most powerful recent criticism along these lines, see Burge, T. 2009.

[12]          Velleman, D. 2000, 30; Korsgaard, C. 2009, 26.

[13]          Anscombe, G.E.M. 1953, 9; see also Hieronymi, P. 2009, 138

[14]          Davidson, D. 1980, 46. Ibid., Essays 1 and 5.; Searle, J. 1983; Goldman, A. 1970; McDowell, J. 1994; McDowell, J. 2007; Mele, A. 1992; Bratman, M. 1987; Brand, M. 1984; Bishop, J. 1989; Pears, D. 1975. I intend for this family to include accounts on which the act-constituting state is a belief-desire pair, a conceptual desire, or a reason.

[15]          Mele, A. 1992, 222; cf. also Nanay, B. 2014.

[16]          Cf. Velleman, D. 2000, 127& 137. Our deepest intuition about agency is that actions are doings by agents, that actions have agents as their source. We find this intuition expressed throughout the history of philosophy, e.g. by Aristotle (DA II.4, 415b10/2; MA 9, 703a1/3), or Kant (B567/9; Lectures on Metaphysics, 29:822 & 29:903).

[17]          Burge 2010, 469ff.

[18]          Shadmehr, R. & Wise, S. 2005, Chapter 12, especially p. 225.

[19]          The same demand applies to reflection- and intention-based accounts. It is related to the problem of deviant chains. Cf. Bishop, J. 1989.

[20]          Chisholm, R. 1978, 627; also his Chisholm, R. 1976

[21]          O’Shaughnessy, B. 1973, 369; O’Shaughnessy, B. 1980; Peacocke, C. 2007; Hornsby, J. 1980.

[22]          Miyake, A. et. al. 2000; Miller, E. & Cohen, J. 2001; Munakata, Y. et al. 2011

[23]          Burge, T. 2010a

[24]          Cf. e.g. O’Reilly, R. & Frank, M. 2006; Gazzaley, A. & D’Esposito, M. 2006

[25]          Posner, M. 1980

[26]          Carrasco, M. 2011

[27]          Geisler, W. & Cormack, L. 2011

[28]          Cf. e.g. Theeuwes, J. 1991

[29]          Folk, C., Remington, R. & Johnston, C. 1992

[30]          The speed of their shifts toward the diamond exceeds that in cases of capture, where the suddenly appearing stimulus does not coincide with the search goal. Cf. Leonard, C. & Egeth, H. 2008.

[31]          Godijn, R. & Theeuwes, J. 2003

[32]          Kristjansson, A. & Campana, G. 2010

[33]          Chun, M. & Nakayama, K. 2000

[34]          Anderson, B. 2013; Hollingworth, A. 2014

[35]          Soto, D., Humphreys, G. & Heinke, D. 2006; Soto, D., Hodsoll, J. Rotshtein, P. & Humphreys, G. 2008.

[36]          Olivers, C. & Eimer, M. 2011; cf. also Olivers, C., Peters, J., Houtkamp, R. & Roelfsma, P. 2011; Stokes, M. & Duncan, J. 2014

[37]          Walther, D. & Fei-Fei, Li 2007

[38]          Oh, S. & Kim, M. 2004

[39]          Lavie, N., Hirst, A., De Fockert, J. & Viding, E. 2004; Lavie, N. & Dalton, P. 2014

[40]          Burge, T. 2010, 58

[41]          Cf. Baddeley, A. D. 2007; Miller, E. & Cohen, J. 2001; Munakata, Y. et al. 2011; Munakata, Y. et al. 2012.

[42]          Section 3.2

[43]          I cannot here do justice to the complexities of the relation between central executive control and attributability to individuals. I address the issue in my “The Individual/Sub-System Distinction [MS].”

[44]          Todd P. et al. 2012; Güntürkün, O. 2005; Emery, N. & Clayton, N. 2004

[45]          Burge, T. 2010

[46]          Cf. Burge, T. 2009. I more fully investigate this conjecture in my ‘Animal Agency [MS].’

[47]          Nixon, M. & Young, J. 2003; Barth, F. 2002; Sleigh, M. 1975

28 thoughts on “Guidance of Visual Attention”

  1. Introduction

    First, thanks to the conference organizers for their invitation to comment, and to Denis Buehler for his thought-provoking paper!

    The paper’s main goal is to use a concept deployed in psychological explanations (“the central executive”) to respond to a philosophical challenge raised by Harry Frankfurt: to explicate the distinction between processes that are guided or unguided by an individual (p. 1). As such, the paper is an excellent example of the fruitful kinds of engagement that can occur between psychology and philosophy.

    Here is a rough outline of the paper. First, Buehler argues that the “primary realizer” of an agent’s guidance of her shifts of visual attention is her central executive system’s control of those shifts (p. 13). He goes on to argue that his thesis about the realization of guidance helps illuminate the concept of guidance (p. 18).

    His paper raises many issues that are worth discussion. In this commentary, I’ll focus on just one. Here’s a brief preview of my worry. Because Buehler wants to use the concept of the central executive to illuminate the concept of guidance, he seems to require that we understand the concept of the central executive better than we understand the concept of guidance. But I’m not sure that we do.

    In the rest of this commentary I’ll expand on the worry, and then end with a constructive suggestion about how it might be addressed.

    1. Buehler on the central executive

    On Buehler’s view, all we can say about the central executive for now is that “the central executive system organizes and coordinates central processing and storage of resources for the execution of cognitive processes, including individuals’ goal-represented activities” (p. 14). He also identifies at least three functions carried out by this system: switching from one task to another, maintaining task-relevant memories, and inhibiting certain kinds of responses. (pp. 14-15). He says that his goal in characterizing the central executive in this way is to abstract away from the details of any specific model of the central executive, since it would be hasty to commit to the success of any particular model.

    Buehler thinks that this concept will help illuminate the concept of guidance. As he says, “[appeals] to the central executive system help us understand individuals’ guidance by connecting it with a kind – central executive control – that appears in psychological explanations” (p. 20). On this way of thinking, we can look to psychological explanations that refer to “central executive control” to learn something about guidance (some of the instances of it, anyways – guidance in creatures similar enough to actual primates).

    2. Baddeley on the central executive

    In order to make my worry about the extent to which we understand the central executive clear, I need to briefly discuss Baddeley and Hitch’s original model of the central executive.

    In 1974, Baddeley and Hitch proposed a model of working memory with three components – a visuo-spatial sketchpad, a phonological loop, and a central executive. A great deal of the current usage of the term “central executive” can, I think, be traced to their use of this term. Here is how Baddeley, in a more recent publication, has described his conception of the central executive:

    “The CE [central executive] is the most complex component of WM [working memory]. Within the original model it was assumed to be capable of attentional focus, storage, and decision making, virtually a homunculus, a little man in the head, capable of doing all the clever things that were outside the competence of the two subsystems.” (Baddeley 2012 pp.13-14)

    Baddeley refers to his conception of the central executive as homuncular; i.e., not really explanatory. But he is aware of this apparent problem, and has a reply to it:

    “Although our model tended to be criticized for taking this approach… I regard homunculi as potentially useful if used appropriately. It is important that they are not seen as providing an explanation, but rather as a marker of issues requiring explanation. Provided the various jobs performed by the homunculus are identified, they can be tackled one at a time, hopefully in due course allowing the homunculus to be pensioned off.” (Baddeley 2012 p.14)

    So on Baddeley’s approach, the term “central executive” shouldn’t actually show up in any complete psychological explanation. Instead, it’s the job of some psychological explanations to explain various aspects of the central executive, and eventually, collectively, to give us reason to do away with the concept. One psychologist – Logie – has even recently argued that we’ve arrived at the time to do away with the concept in his recent paper “Retiring the central executive” (2016).

    Miyake et al., who Buehler cites in footnote 25 as a source on the central executive, seem to approach the central executive like Baddeley. They say that their paper is meant to address the problem that “the field still lacks a compelling theory of executive functions – general purpose control mechanisms…” (Miyake et al. 2000 p. 50). Therefore, they seem to think that the central executive is something to be explained rather than something can be used to explain other phenomena.

    This seems like a fine way for psychologists to understand the central executive system. However, it seems to me that this approach to the central executive is in tension with Buehler’s project. Buehler wants to use the central executive to explain guidance. But if the central executive is itself something that is in need of explanation, it seems to me that this explanatory strategy might not be successful. It we want to explain guidance, then we should advert to something which does not itself require explanation – and especially not something that psychologists intend to eventually explain away. Instead, we should advert to something already well understood, or something that we have good reason to think will be easier to explain than guidance.

    3. Moving Forward

    I’d like to finish things off by outlining, in a very rough form, one possible method of addressing the problem I’ve raised. It’s modelled off of Wayne Wu’s (2014) approach to perceptual attention.

    To put it quickly, Wu thinks that despite apparent disagreements between psychologists about what perceptual attention “really is”, an analysis of the experimental practice of many psychologists shows that there is an account of perceptual attention that they tacitly agree on (Wu 2014 pp. 38-40).

    On this kind of approach, we don’t learn about perceptual attention just by looking at what is said by psychological explanations that deploy the term “perceptual attention”. Instead, we look at those explanations, but we should also keep in mind how well they cohere with the best account of what many psychologists really mean when they say “perceptual attention”. This kind of approach is empirically informed, but critical and careful in a particular respect.

    Perhaps analogous work on the term “central executive” would help Buehler address the worry I’ve raised here. It might be that the best analysis of what certain psychologists have to say about the central executive gets us the result that the central executive is better understood than Baddeley thinks it is, or at least not a concept that should eventually be done away with.


    Baddeley, A.D. (2012) “Working Memory: Theories, Models, and Controversies”. Annu. Rev. Psychol. 63:1–29
    Baddeley, A.D. & Hitch, G.J. (1974). Working memory. In G.A. Bower (Ed.), Psychology of learning and motivation (Vol. 8, pp. 47–90). New York: Academic Press.
    Logie, R. H. (2016). “Retiring the central executive”. Q J Exp Psychol. 19:1-17.
    Miyake, A., Friedman, N. P., Emerson, M. J., Witzki, A. H., Howerter, A., & Wager, T. D. (2000). The unity and diversity of executive functions and their contributions to complex “frontal lobe” tasks: A latent variable analysis. Cognitive Psychology, 41: 49–100.
    Wu, W. (2014). Attention. New York: Routledge.

  2. Introduction

    I am grateful to the organisers of the Minds Online conference for inviting me to comment on Denis Buehler’s excellent paper. Buehler contributes to action theory by developing an empirical explication of individuals’ guidance. The specific empirical case study he focuses on is the psychological account of how shifts of attention in visual search are controlled by cognitive mechanisms. He concludes that the controlling mechanism that realises individuals’ guidance of shifts of attention in primates is the central executive system. From this, Buehler argues for the more general claim that the “control of a process by an individual’s central structures (partly) constitutes that individual’s guidance” (Section 1, Paragraph 11).

    I am sympathetic to the general constitutive claim Buehler offers, and agree that controlling capacities of an organism’s central structures play an important role in how the organism guides its behaviour. However, I have worries with regard to the more specific claim—concerning the relations between shifts of visual attention, central executive functions, and guidance—that Buehler builds upon to reach his conclusion, and think that Bueller has chosen a case study that does not serve his purpose. More specifically, I am not convinced by the arguments offered in the paper to the conclusion that central executive control of shifts of attention realises individuals’ guidance, since I am not convinced that a shift of attention is a guided behaviour in a relevant sense of the term.

    Shifts of attention and its controlling mechanisms

    Before introducing my worries in detail, I would like to amend Buehler’s characterisation of what psychology and neurobiology have uncovered about shifts of attention and their controlling mechanisms. Buehler purposefully abstracts away from specific computational and neural mechanisms, however, as I shall argue below, there are crucial details that seem to be in tension with his conclusion regarding attention shifts, executive functions and guidance.

    Attention comes in many different varieties, like overt or covert; focal or residual; spatial, feature-based, or object-based; endogenous or exogenous, etc. (Chun et al., 2011; Carrasco, 2011). The distinction that is relevant from the perspective of Buehler’s project is the endogenous versus exogenous distinction, since while the other terms characterise where and how attention is allocated, this distinction characterises how the allocation of attention is initiated and controlled. Endogenous attention is activated in a top-down manner by internal causes like goals, beliefs and desires, and is typically under voluntary control, whereas exogenous attention is stimulus-driven, bottom-up, involuntary and automatic. (Note, however, that Wu (2014) claims that the top-down versus bottom-up and the controlled versus automatic distinctions come apart, and Jennings (2012) argues that the endogenous versus exogenous and voluntary versus involuntary distinctions are different).

    As it has recently been argued, endogenous and exogenous attention correspond to distinct underlying networks in the brain. The functioning of attention in general is associated with the functioning of three different networks: an alerting network that maintains optimal vigilance during tasks, an executive network that is responsible for target detection for focal processing, and an orienting network that prioritises sensory input (Posner and Petersen 1990; Petersen and Posner 2012; to see how these networks are related to executive functions see e.g. Diamond, 2013).

    The so-called orienting network consists of two partly overlapping networks that can be corresponded to top-down and bottom-up orienting. The activity of the so-called dorsal frontoparietal network, i.e. frontal and parietal areas along the dorsal stream (like the intraparietal sulcus and the frontal eye field), influences perceptual processing in accordance with actual task demands, and is responsible for goal-directed stimulus-response selection. The other, more lateralised, so-called ventral frontoparietal network (including the temporoparietal junction and the ventral frontal cortex) plays a circuit-breaking role by breaking ongoing cognitive activity when a salient unexpected stimulus is detected. On the basis of these functions, the dorsal system is associated with orienting endogenous attention, whereas the ventral system is associated with orienting exogenous attention (Corbetta and Shulman, 2002; Petersen and Posner 2012; Wu 2014).

    Although Buehler claims that the “exogenous system exclusively responds to stimuli that are physically salient” (Section 3.1.1, Paragraph 1), there is empirical evidence to the contrary: the ventral frontoparietal network is sensitive to behaviourally relevant stimuli, i.e. it plays its circuit-breaking role if behaviourally relevant stimuli are detected outside focal processing. Corbetta and Shulman (2002) propose that information about behavioural relevance is mediated by connections between the ventral and the dorsal systems. That is exogenous attention is partly driven by goals, beliefs and task demands representing areas that are typically associated with top-down, voluntary, goal-directed control. Even more importantly from our present perspective, there are tight interactions between top-down and bottom-up processing in the dorsal frontoparietal system as well. Corbetta and Shulman (20002) report that the dorsal network is modulated not only by contextual, task-relevant information but also by the sensory distinctiveness of stimuli, i.e. by information that is typically associated with bottom-up, automatic, stimulus-driven control (Corbetta and Shulman, 2002).

    The existence of such interactions between top-down and bottom-up control of attention is supported by state-of-the art computational models of neural effects of attention as well. According to the so-called Normalisation Model of Attention (Reynolds and Heeger, 2009; Lee and Maunsell, 2009), neural output is jointly determined by three distinct factors: stimulus features, the operation of an ‘attentional field’—that exerts its effects by multiplying the excitatory input of the neurons in question,—and built-in input processing mechanisms that normalise neural responses via divisive suppression, i.e. by dividing the stimulus drive by the activity of a pool of surrounding neurons (see Herrmann et al. 2010, and Cutrone et al., 2014 for empirical support; Beuth & Hamker, 2015 for a mechanistic microcircuit model; and Wu, forthcoming for a philosophical summary). Crucially from our present perspective, within this model the ‘attentional field’ is jointly determined by long range frontal back-projections mediating top-down information about task relevance and bottom-up signals mediating the effects of attentional capture (Heeger, personal communication).

    The output of these neural computations (amplified and/or sharpened neural representations)—and thus ultimately the tight interaction between top-down, task-related, and bottom-up, sensory control mechanisms (see also Corbetta and Shulman, 2002)—generates the so-called saliency maps (Itti and Koch, 2000, 2001) that prioritise stimulus-representations on the basis of their overall salience, serve as the basis of serial selection for recognition and action, and thus play a crucial role in directing shifts of attention during visual search tasks.

    Central executive control and individuals’ guidance

    With all this at hand, consider Buehler’s argument for the empirical explication of individuals’s guidance in terms of central executive control. Buehler acknowledges that for central executive control to help explicate individuals’ guidance central executive control and individuals’ guidance must appropriately correlate.

    Given the distinction between central executive (i.e. endogenous) and exogenous control of shifts of attention, Buehler needs to establish that individuals’s guidance appropriately correlates with endogenous control and not with exogenous control. From this perspective, the tightly intertwined nature of endogenous and exogenous control (see previous section) poses a problem. How can individuals’ guidance correlate with one but not the other?

    Buehler argues that the crucial difference is that while goal-directed endogenous control drives attention towards the target, stimulus-feature based exogenous control can drive attention away from the target and thus interfere with goal directed search. This claim, however, does not respect the actual mechanism underlying shifts of attention. Recall that under realistic conditions in visual search tasks there are no simple cases of purely endogenously or exogenously driven shifts of attention. In the case of what is traditionally conceived of as an endogenous shift of attention, certain perceptual representations of specific stimulus-features throughout the visual field or at specific spatial locations (determined by actual task demand) are enhanced, which modulates the actual saliency map. Nevertheless, attention will be shifted to the stimulus with the highest saliency, no matter which stimuli were targeted during the task. So in cases where top-down attentional amplification is not able to push the targeted stimulus to the top of the priority list of the saliency map attentional shifts will wander from distractor to distractor until finally it lands on the target. Note that although such a case is often called an instance of goal-directed allocation of attention, strictly speaking attention is not directed towards the target—it is directed towards the stimulus with the highest salience, which might or might not be the target. Similarly, just as endogenous control does not shift attention ‘towards’ the target, exogenous control does not shift attention ‘away’ from the target. Put it simply, shift of attention is always captured by the stimulus with the highest salience.

    That is, if one thinks that such endogenously controlled shifts of attention are guided, then it is hard to see why exogenously controlled shifts would not count as guided. And vice versa: if one thinks that exogenously controlled shifts of attention are not guided, it is hard to see why endogenously controlled shifts would count as guided. From this perspective, the description in the previous paragraph suggests that neither exogenous nor endogenously controlled shifts of attention are guided in the relevant sense.

    To see why, consider how individuals’ guidance might be characterised at the pre-theoretical level. Buehler offers an analogy with a mountain guide. According to Buehler, a good guide “picks the peak as her expedition’s destination and orients her expedition toward the peak”, “knows the route to the peak”, and “stably steer toward the goal of the expedition” (Section 3.2, Paragraph 2). Compare this characterisation with the description of how shifts of attention are determined. We have already seen the major problem: shifts of attention, even endogenously controlled shifts of attention do not know the ‘route to’ the target, they do not ‘stably steer toward’ the target. If central executive control of shifts of attention is like a mountain guide, then it is a pretty bad one. Imagine a mountain guide who only enhances the chance of reaching the summit the expedition wants to reach, who checks if they have reached the right summit only after the expedition has gotten onto the top, and if not then moves the expedition on towards a next summit which might or might not be the one they want to reach. Shifts of attention are guided just as much as such an expedition is.

    The situation is not better even if instead of Buehler’s mountain guide analogy, we rely on Frankfurt’s (1978) original characterisation of individuals’ guidance. As Frankfurt puts it, a behaviour is guided in the relevant sense if “its course is subject to adjustments which compensate for the effects of forces which would otherwise interfere with the course of the behaviour”, such that these compensatory adjustments tend to “ensure that the behavior is accomplished” (Frankfurt, 1978, p. 160). However, as we have seen it, central executive control of shifts of attention in visual search is not able to exert such adjustments that could compensate for the effects of salient distractors, and it cannot ensure that the visual search gets accomplished. Therefore, endogenously controlled shifts of attention do not seem to be guided in the original sense of the term.


    In this brief comment I argued that endogenously controlled attention shifts in visual search are not forms of guided behaviour in the sense relevant for action theory, and thus central executive control of shifts of attention cannot realise individuals’ guidance. This means that Buehler’s present argument is not sound, since its first premise is false. However, it does not seriously jeopardise Buehler’s overall project. Buehler could still establish his general constitutive claim (“the control of a process by an individual’s central structures partly constitutes that individual’s guidance”) by showing that whenever a behaviour is guided it is under the control of some central structures.


    Beuth, F., and Hamker, F. H. (2015): A mechanistic cortical microcircuit of attention for amplification, normalization and suppression. Vision Research, 116: 241-257.
    Carrasco, M. (2011): Visual attention: The past 25 years. Vision Research, 51: 1484-1525.
    Chun, M, Golomb, J D, and Turk-Browne, N. (2011): A Taxonomy of External and Internal Attention, Annual Review of Psychology, 62: 73-101.
    Corbetta, M., Shulman, G. L. (2002): Control of goal-directed and stimulus-driven attention in the brain. Nature Review Neuroscience, 3: 201–215.
    Cutrone, E. K., Heeger, D. J., and Carrasco, M. (2014): Attention enhances contrast appearance via increased input baseline of neural responses. Journal of Vision, 14(16): 1-14.
    Diamond, A. (2013): Executive functions. Annual Review of Psychology, 64: 135-168.
    Frankfurt, H. G. (1978): The problem of action. American Philosophical Quarterly, 15: 157-162.
    Herrmann, K., Montaser-Kouhsari, L., Carrasco, M., and Heeger, D. J. (2010): When size matters: Attention affects performance by contrast or response gain. Nature Neuroscience, 13(12): 1554–1559.
    Itti, L., and Koch, C. (2000): A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Research, 40: 1489-1506.
    Itti, L., and Koch, C. (2001): Computational modelling of visual attention. Nature Review Neuroscience, 2: 194-202.
    Jennings, C. D. (2012): The Subject of Attention. Synthese, 189 (3): 535-554.
    Lee, J., and Maunsell, J. H. R. (2009): A normalization model of attentional modulation of single unit responses. PLoS ONE, 4(2): e4651.
    Petersen, S. E., Posner, M. I. (2012): The attention system of the human brain: 20 years after. Annual Review of Neuroscience, 35: 73-89.
    Posner, M. I., Petersen, S. E. (1990): The attention system of the human brain. Annual Review of Neuroscience, 13: 25-42.
    Reynolds, J. H. and Heeger, D. J. (2009): The normalization model of attention. Neuron, 61(2): 168–185.
    Wu, W. (2014): Attention. New York: Routledge.
    Wu, W. (forthcoming): Shaking up the mind’s ground floor: the cognitive penetration of visual attention. Journal of Philosophy.

  3. I am very happy to comment on Denis Buehler’s interesting paper ‘Guidance of Visual Attention’. I will start with a brief summary of how I understand Denis’ goal and argumentative strategy, and then move on to comments and critical remarks.


    Denis’ paper aims to contribute to a solution of Harry Frankfurt’s problem of action, which – in Frankfurt’s own words – is to “explicate the contrast between what an agent does and what merely happens to him” (Frankfurt 1988 [1969], p. 69). Following a central idea in Frankfurt’s work, Denis starts from the assumption that the relevant contrast is between processes that are guided by the relevant individual and those that are not under the individual’s guidance (p. 1). After trying to show that some of the most prominent contemporary accounts of agency fail, since they either fail to be comprehensive enough to cover simple and non-human agency in animals, or fail to explain why guidance is attributable to the whole individual, Denis then aims to explicate the relevant concept of guidance by means of studying a central case, namely the individual’s guidance of visual attention. In this case, Denis argues, guidance is realized by the central executive control system. We can explicate the notion of guidance by appeal to central executive control, he argues, because (a) we can empirically show that such control correlates with an individual’s guidance (p. 17), and (b) appeal to such control deepens our understanding of guidance and thus helps to ground explanations involving the notion of guidance (p. 2, 17). On the basis of this case study, Denis finally conjectures that, generally, an individual’s guidance is at least partially constituted by “[t]he control of a process by an individual’s central structures” (p. 24).

    I really like what Denis takes as his case study. In the case of attention shifts the distinction between active and passive has been studied empirically in great detail, and so I share the hope that a close look at this case study may help to make progress with the problem of action. The case is also interesting for two other reasons. First, consider that visual attention shifts are clearly mental acts that are not “rehearsals of physical actions” (Carruthers 2009, p. 152) and do not involve any “visual and other forms of imagery” (ibid.). Anyone who, like Carruthers, still thought that there really aren’t mental acts thus is shown to be mistaken. Second, consider that if some stretches of attention are cases of agency that are under the individual’s guidance then it is hard to see how attention could be a necessary part of the explanation of individually guided agency since we would already need attention in order to get attention.

    Strategy and Overall Shape of the Project

    Let me now move to some comments on the general shape of the project. Denis says that he is interested in an explication of the concept of an individual’s guidance. While he clearly wants such explication of a concept to be different from conceptual analysis (p. 2), I was wondering why Denis talks about concepts at all. Why not just talk about providing an account of what constitutes an individual’s guidance, i.e. at least a partial account of (the nature of) guidance? Given that Denis clearly believes that empirical work can play a central role in his project, isn’t that project better captured by dropping talk of concepts (physics tells us about the nature of space not the concept of space, biology about the nature of tigers not the concept of a tiger, and the psychology of attention about the nature of guidance and not the concept of guidance)? While I don’t think it matters much for what follows, I would like to invite Denis to comment on why he frames his project in terms of conceptual explication.

    The Critique of Recent Action Theory

    Denis argues that many recent theories of agency violate the comprehensiveness constraint: they are focused on sophisticated human agency and stand no hope to explicate what distinguishes the active from the passive movements of a spider. I tend to agree with this complaint. But the complaint rests on the assumption that these accounts are trying to explicate the same phenomenon that Denis is trying to explicate. An opponent might here disagree on what the relevant explicandum is. She might think that the most significant kind in the relevant neighborhood (or maybe the most important concept) is the sophisticated, characteristically human kind. If the objector were able to show that this “sophisticated agency” has a powerful explanatory role, normative structure, and/or central place in our conceptual framework that is not shared by primitive agency, then – I believe – we cannot fault them for not accounting for primitive agency or primitive guidance. An opponent of this type might agree that there we need an account of this primitive agency, but this would not solve or even contribute much to solving the philosophical problem of action. In order to counter this objection, I believe, one would need to say a bit more about what makes primitive agency significant.

    Routine Action. Guidance or Control?

    I was wondering about how Denis understood the notion of guidance that he sought to explicate. Harry Frankfurt, correctly I believe, allowed for cases of agency where a process is under the individual’s guidance even though she is not currently controlling the process in any sense (p. 75). He uses the example of driving downhill. You might actively drive (and guide the driving) even at times when you do not intervene to adjust the movement that his happening though gravitational forces. In cases like this the individual’s guidance consists in her preparedness to intervene if necessary, and her being in a position to do so more or less effectively (ibid.). Many routine actions arguably are similar: walking toward the kitchen in the morning, effortless reading, or playing a well-rehearsed piece on an instrument are an individual’s acts that she doesn’t control. They are under her guidance in Frankfurt’s sense, since she could effectively intervene if necessary, but she does not currently control them. I wasn’t sure that Denis does and can allow for such cases of guidance. This issue comes up in two places.

    First, there is Denis’ critique of accounts of agency that fail to distinguish guidance from background conditions of guidance. Why couldn’t an intention based account, for example, conceive of intentions as types of states whose functional role is to keep the agent “on track” (and whose existence explains why she is able to adjust and interfere if necessary). Denis seems to want guidance to be more “online” than this. But I am not sure that one can do this while accounting for the driving type cases.

    Second, the issue of routine action comes up with regard to Denis own account. It is at least unclear that cases like this require the central executive or executive function. Executive function is normally thought to be required exactly when actions have not been routinized. When they are, they do not require executive resources. This problem for Denis’ approach can be put as a dilemma. If guidance by the individual requires her current causal control, then many forms of agency are not guided by the individual and thus the category of processes that are guided by the individual is not even approximately coextensive with the category of ‘what an agent does’. But if guidance is understood more broadly (as Frankfurt understands it), then it is highly unclear that the central executive system is necessary for guidance as in routine actions. We would thus like to hear more from Denis about the central executive as he thinks of it that makes room for its deployment also in routine action.

    What is the central executive system? Does it explain anything?

    Denis, importantly, notes that recent research shows that the distinction between active and passive attention shifts does not coincide with the distinction between endogenously and exogenously generated attention shifts. Past experience and reward, for example may initiate a passive attention shift. The passive attention shift is endogenously generated and controlled. Which endogenous attention shifts then are guided by the individual? Denis thinks that it’s those that are controlled by the central executive. I worry about how explanatory this suggestion is. The concept of the central executive system, as I know Denis is aware, was first introduced by Alan Baddeley in his highly influential work from the 1970s as an important component of working memory. In a recent review of work that was influenced by his research Baddeley notes that the invocation of the central executive has often been criticized for invoking a sort homunculus that can do all the things the subsystems cannot do. For this reason, Baddeley says, appeal to the central executive itself should not be understood “as providing an explanation, but rather as a marker of issues requiring explanation (Baddeley 2011, p. 14). As Denis notes there are a number of functions attributed to the central executive: task switching, working memory maintenance, and inhibition of distractors. Not all of these are involved in every active attention shift (or visual search). But what then is it supposed to mean that “the central executive” controls the attention shift? Denis says various things about that: on p. 19, for example, he says that “the central executive system functions to integrate and coordinate individuals’ states and competencies for the successful execution of individual-level processes”. What does the explanatory work, it seems to me, here isn’t a quasi-homuncular executive system, but the fact that the individual’s states and competencies are coordinated and integrated so as to allow the successful pursuit of her goal. The individual guides her activity, because it isn’t the case that one of her subsystems is controlling the act by itself, but rather she can bring to bear a wide variety of states and competencies in pursuit of her goal. My question for Denis then is: is guidance really explained by “control by the central executive” or rather by the integration and coordination of a wide variety of individual level states, processes, skills and competencies? On the alternative picture then there isn’t a “central structure” (p. 25) that coordinates the parts of the individual, but there is just the integration and coordination of the parts and states of the individual.

    Is there an a priori argument for Denis’ thesis?

    Finally, I would like to ask about the role of the empirical research in the explication of guidance. Step back for a moment, and consider complex objects generally. Which states and processes are fundamentally states and processes of the whole object, and which are fundamentally states and processes of its parts? Consider the molecular structure of some complex molecule. That structure clearly is a property of the whole complex object and not its parts because having that structure is a property that involves relations between the parts (no single part, or sub-system, has that structure). Generally, it seems very plausible that a property is fundamentally a property of a whole W with parts pp and not any of its parts p if having that property requires that those pp‘s to stand in specific (external) relations. But then it seems to follow more or less a priori that a process is guided by the individual if it is guided in a way that integrates and coordinates its various sub-systems (of course, there is a question of who “big” the relevant whole really is, since it is unlikely that all subsystems of the individual are integrated and coordinated). So, I would like to invite Denis to say more about what exactly we are supposed to gain from the empirical data he appeals to.


    Baddeley, A. (2012). Working Memory: Theories, Models, and Controversies, Annu. Rev. Psychol., 63, 1–29
    Carruthers, P. (2009). Action-Awareness and the Active Mind, Philosophical Papers, 38(2), 133-156
    Frankfurt, H. (1988 [1978]). The problem of action, Reprinted In: H. Frankfurt. The importance of what we care about. Philosophical essays (Ch. 6, p. 69-79). Cambridge, UK: Cambridge University Press

  4. I want to thank Peter, Mark, and Sebastian for taking the time to read, think about, and comment on my paper. Having people spend their time on one’s own work is a humbling experience, especially for someone early in their career. I am deeply grateful for it. I’ve learned a lot from working through the three sets of comments. And I hope I am doing justice to each of them.


    Reply to Fazekas: Captured attention

    I really enjoyed the extent to which Peter delved into psychological research relevant to my paper. Peter thinks that I misuse or misunderstand our psychological evidence concerning visual attention shifts. Naturally, I disagree. Let me try to explain why.

    A loud bang, or bright light flashed at you, will capture your attention. There is some reason to think that you may be able to voluntarily suppress such a shift, if you really try, or if you focus your attention on some other thing. But, typically, such a sudden, abrupt, physically salient stimulus will capture your attention. In the paper, I identify this type of episode as capture of attention. I claim that such shifts of attention are intuitively passive, non-guided episodes. Psychological processes underlying this type of episode hence do not appropriately correlate with individuals’ guided shifts of attention.

    In the paper, I report that psychologists explain such captured attention shifts in terms of the exogenous system’s activity alone. I write: “the exogenous system exclusively responds to stimuli that are physically salient (p. 10).” Peter mistakenly thinks that I claim that the exogenous system does never respond to stimuli other than physically salient ones. This misunderstanding is my fault. I should have written “the exogenous system alone exclusively …” If I were to make the claim Peter takes me to be making, I would be in contradiction with what I say in the next paragraph, where I address ways of interaction between the exogenous and the endogenous system, and mention research to the effect that endogenous goals can “tune the exogenous system to be more sensitive” to certain stimuli.


    Does Peter want to claim that physically salient stimuli alone never determine where attention shifts? That claim seems antecedently implausible – remember the above examples of captured attention. The claim is also not supported by psychology.[i] Even the authors that Peter cites seem to acknowledge such captured attention shifts.[ii]  Recent computational models do not support the claim.[iii] Capture as in the example above will not be mediated by your current expectations or goals. The behavioral relevance of capturing stimuli lies in the evolutionary advantage that comes with keeping track of abrupt changes in your environment. We must distinguish such cases from the influence that current (endogenous) goals have on attention-shifts when they, for instance, guide attention toward a tool relevant for an ongoing bodily action.[iv] Peter is right to point out that most of the time, attention shifts are influenced by our occurrent goals or other endogenous factors. And indeed, psychologists sometimes have called some of these types of shifts instances of ‘captured’ attention. But I find this nomenclature misleading – after all, the individuals’ goals drive these attention shifts. In the paper I focus on what I take to be clear cases of capture.[v]


    Peter writes:

    “Given the distinction between central executive (i.e. endogenous) [my emphasis] and exogenous control of shifts of attention, Buehler needs to establish that individuals’ guidance appropriately correlates with endogenous control and not with exogenous control. From this perspective, the tightly intertwined nature of endogenous and exogenous control … poses a problem. How can individuals’ guidance correlate with one but not the other?”

    The passage suggests that Peter takes himself to have established that endogenous and exogenous activity are always ‘entwined.’ (Or else there would be no in-principle problem with the approach Peter worries about.) I have explained why I think that Peter has not established this.

    As I acknowledged earlier, Peter is certainly right to think that endogenous and exogenous factors interact, most of the time, so as to generate shifts of attention. In section 3.1 I explicitly acknowledge the interaction between exogenous and different types of endogenous factors. But in the quote above, Peter mistakenly suggests that I want to explicate guidance as control by endogenous factors, and in contrast with exogenous factors. The point of section 3.1 is rather to distinguish which endogenous factor plausibly correlates with individuals’ guidance, and which endogenous factors do not.[vi]


    So which factor does correlate with individuals’ guidance? Peter claims that

    “Buehler argues that the crucial difference is that while goal-directed endogenous control drives attention towards the target, stimulus-feature based exogenous control can drive attention away from the target and thus interfere with goal directed search,” and that my answer to the question “does not respect the actual mechanism underlying shifts of attention.”

    This portrayal is not quite accurate. I believe that, first, there is a sense in which the relevant endogenous factor drives search toward the search goal, but, second, both endogenous and exogenous factors can contribute to driving attention away from the actual target.

    Computational models and behavioral studies of visual search suggest this more complex picture. During visual search, a representation of the search goal held in memory determines the computations that lead to assignments on the priority map. Our best models of shifts during search assume that attention moves to the location on the map that has the highest priority value. This value is determined, partly, by the similarity between the represented goal and stimuli at specific locations. So these models offer a concrete sense in which the search goal causally controls the computations of assignments on the priority map (or “drives” attention toward the goal).

    Other factors, too, figure into the determination of the priority value for any given location on the map. These factors include the physical saliency of a location, information about types of scenes, or priming for specific features. Insofar as these influences derive from factors other than the search goal, they draw attention away from its goal.[vii] But even when attention lands in the wrong location, we can say that the processes are ‘guided toward some search goal.’[viii]

    How can we distinguish the search goal’s influence on the priority map from that of other endogenous factors, such as other memories in working memory? Behavioral studies suggest that the difference is due to the allocation of central resources by a central executive system. My proposal is that, in spite of the constant interaction between the different factors, appeal to psychology helps us identify one endogenous factor – central executive control – that does correlate with individuals’ guidance.


    Finally, how can the executive system’s control of attention shifts help us illuminate marks of the concept guidance, such as ‘knowing the route to the goal’ or ‘stably steering toward the goal’? Illumination comes from appeal to the more sharply characterized scientific notions. Naturally, we cannot expect the system to do its job more or less without fail. After all, the system does not know where the search target is located. Still, it has a template of its search target and uses surprisingly sophisticated and successful computations to search for that goal as efficiently as possible.[ix] And the system won’t let off until it has actually found its target. The system’s situation, in visual search, is more like that of a mountain guide who merely owns a drawing of the peak’s contours, and is supposed to navigate the expedition toward that peak in a foggy, cluttered environment. Given these conditions, success rates like the attentional system’s are impressive. I’d follow that mountain guide on her expedition. Well, if I absolutely had to go on the expedition, at any rate.


    Reply to Fortney: Explanation in terms of central executive control

    Mark worries that I want to

    “use the concept of the central executive to illuminate the concept of guidance, [and hence that I] … require that we understand the concept of the central executive better than we understand the concept of guidance. But [Mark is] not sure that we do.”

    Therefore, I presume, appeal to central executive control does not illuminate guidance. The worry is natural. Philosophers and psychologists alike have criticized research on executive functions as not explanatory. I believe that these criticisms are now out-dated. Unfortunately, I did not have space, in the paper, to expand on my views about research on executive functions. That is why I am particularly happy that Mark voices his concerns about the issue.[x]


    Baddeley writes of the central executive system (cf. Mark’s quote):

    “Within the original model [the central executive] was assumed to be capable of attentional focus, storage, and decision making, virtually a homunculus.”

    In Baddeley’s original model the central executive was homuncular, because the very abilities that psychologists wanted to explain were attributed to the system. Psychologists wanted to explain psychological competencies of individuals, such as decision-making. Ideally they wanted a neuro-computational mechanism that would explain such processes. Attributing an ability to decide to a system, of course, does not explain individuals’ ability to decide. And it is far from a fully mechanistic explanation of decision-making. A series of psychologists and philosophers, accordingly, considered the central executive system to be merely a placeholder.[xi]


    Notice the following passage in Mark’s second quote from Baddeley, however:

    “Provided the various jobs performed by the homunculus are identified, they can be tackled one at a time, hopefully in due course allowing the homunculus to be pensioned off.”

    Psychological explanation typically advances from identifying an explanandum through a series of stages of ‘breaking-down’ the explanandum into components. Each movement from a higher-level to a lower-level stage constitutes explanatory progress. A psychological process such as object-vision is identified as plausibly involving the sub-components of image-based and surface-based processing. Next, psychologists tackle the problem of explaining each sub-component.

    The psychological investigation of executive functions has seen progress along these lines, in the last 20 years. I believe that especially Miyake et al.’s introduction of confirmatory factor analysis into this field was an important step.[xii] It allowed developing and testing hypotheses about the different sub-components that characterize the central executive system. In this way, Miyake et al. identified three plausibly basic executive functions: switching, inhibition, and maintenance. This work allowed breaking these sub-components down into further sub-components.[xiii] The process of breaking down functions allowed more precise behavioural study of the components, developing computational models of the components, and identifying neural structures that may implement the different components.[xiv] It allowed investigating the role of executive functions in more complex cognitive tasks, and their relations to such issues as general intelligence, ageing, anxiety, and others.[xv] Many recent developments in the field constitute genuine explanatory progress.


    How, then, do appeals to central executive control deepen our understanding of individuals’ guidance? The characterization of the system may be preliminary and will certainly develop with future research. Appeals to the system nevertheless deepen understanding of guidance in at least two ways. A first way in which such appeals deepen understanding is by connecting our pre-theoretic notion with notions that psychologists investigate. Such a connection can help sharpen the pre-theoretic notion. There can be no doubt that current psychological research provides characterizations of executive functions that are sharper than our pre-theoretic notion guidance. A second way is that, thereby, we can see that the knowledge psychologists have acquired or will acquire about the system – about computational models of executive functions, behavioural profiles, their connection with general intelligence and so forth – in principle bears on and can help illuminate our pre-theoretic notion. That knowledge was not part of our pre-theoretic understanding of guidance.


    I think that Mark is right to caution against identifying the central executive system “just by looking at what is said by psychological explanations that deploy the term “[central executive control]”.” More generally, Mark seems right when he seconds Wayne Wu’s recommendation that we should

    “look at [psychologists’] explanations, but we should also keep in mind how well they cohere with the best account of what many psychologists really mean when they say “[central executive control]”.

    That certainly is what I take myself to be doing.[xvi] I hope that impressions to the contrary were mainly due to the all-too condensed presentation of my views on the central executive system in the paper.


    To conclude, I do believe that we have a fairly good grip on certain aspects of the central executive system. I do believe that there is a conception of that system that probably most psychologists working in the field could agree on. Progress in the field, as in many parts of psychology, has been impressive. It is time for philosophers to engage with this research and to use it to address philosophical issues. One way of doing so, I continue to believe, is explicating guidance by appeal to this research.


    Reply to Watzl: Primitive action, reflex action, and appeal to empirical research

    There is much that Sebastian and I agree upon, I think. I agree, for example, that attention shifts are mental acts; I agree that much of the discussion could have been framed in terms of the nature of guidance; I also agree with Sebastian’s observation about attention’s not being a necessary element in agency. Some of his questions are on topics I did not have opportunity to talk about in the paper and that I would like to address more fully in future work. Let me try to answer some questions that come up in Sebastian’s comments.


    Is action theory talking about a different, more sophisticated phenomenon, than what I am interested in? I do not think so. I believe that the core of our concept agency is that an individual does something, is the source of a process. Less fundamentally, an acting individual aims at something. Both phenomena can be encountered across the animal kingdom. An account of agency cannot ignore this fact.

    I partly agree with the stance Sebastian proposes. We can get the most out of many extant accounts if we use them to understand more sophisticated types of action. The accounts have difficulties there, too. Some of them I mention in the paper. Even humans plausibly exercise more primitive agential capacities than some of the accounts acknowledge. Further, in order to fully explain, for instance, perception-guided action, we must address the attributability-constraint. Extant accounts do not do that. So they do not even fully explain the phenomena that are clearly supposed to be in their purview.

    But I also think that the focus on sophisticated human action betrays a more general tendency that we should counteract. A too exclusive focus on humans leads to false theorizing even about some of our own abilities. Such a focus skews our view of our own place in the natural world. Understanding our capacities as continuous with those of other animals may not merely help us avoid making such mistakes. It may open new avenues for the scientific and philosophical study of these capacities. Primitive action is significant or of interest because to fully understand agency, I think, we need to understand it in all its breadth.


    What is the role of the central executive system in routine and reflex action? This is an excellent question, and my answer to it may be somewhat disappointing. In much routine action, I believe, the central executive system plays a background-organizing role. Take the case where you walk up and down your room while thinking about your paper. The central executive system activates or triggers your walking as much as your thinking, coordinates the two activities while you’re executing them, and allocates resources to them.[xvii]

    How about reflex actions? Think of the boxer’s weaving a jab, or your pulling away your hand from the stove; distinguish these cases from mere reflexes, such as a knee jerk. It seems plausible that the former episodes are acts. Do they involve executive control? I can see several ways of accommodating such actions in the present framework:

    1. They may involve minimal executive control, similar to routine actions.
    2. They may be directly available to central executive control (as in Frankfurt’s driving-example) while not concurrently requiring exercise of such control for their execution.
    3. Acquiring the ability to react in any of these ways may require executive control, while subsequent performance can occur quasi-reflexively.
    4. They may constitute a class of non-guided reflex acts for which my picture cannot account.

    Naturally I have a preference for options 1-3, since each of these options would allow me to explicate (mammals’) agency in terms of a process’s relation to executive control. Option 4 would make such an explication more difficult. On this option, I might attempt to argue that some reflex-‘acts’ are not really acts. For others it may be plausible to argue that they are not central to our notion of action. I then need to give an explanation as to why these are acts nevertheless. In the end, a full answer to Sebastian’s question is, at least in part, an empirical matter. I am currently working toward such an answer. For the time being I suspend judgment as to which option is the correct one.


    What does the central executive system explain? I say more about the status of research on central executive control in my reply to Mark. If our aim is the psychologist’s, namely, to explain, for instance, how an individual can solve the Tower of Hanoi test, then appeals to central executive control may not explain much. We do not understand well the mathematical and other skills involved in solving the test. We do not understand well how the central executive system helps us to successfully complete the task. We are far from having full computational or neural models, or a full mechanistic explanation of the processes underlying these tasks. It is in this sense, I think, that Baddeley considers the central executive system a “placeholder.”

    My claims about explanation are much more modest. There is empirical reason to think that there is a central executive system that helps coordinate and integrate individual’s capacities. There is empirical reason to think that the system does this executing a set of executive functions. By making connecting executive control with guidance we can both sharpen the notion and make present and future psychological research bear on our understanding of the notion.

    Sebastian is right to point out that not all executive functions are involved in every active attention shift. But some of them are (or so I claim). For instance, a spontaneous active shift may involve switching from one act to the next. The characterization of the system as an integrator and coordinator is rather of relevance in explaining why this involvement of the system explicates the individual’s guidance. In this sense, it is appeal to this system’s activity that helps us explicate guidance, not appeal to the individual’s activities.


    Can we establish my claims about guidance in an a priori manner? I like Sebastian’s idea and was initially very attracted to it myself. But I don’t believe that my specific proposal can be had quite that easily. In some sense, your metabolism is a structure that integrates and coordinates activity in different parts of yours. But your metabolism doesn’t yield the right kind of whole individual. Since I cannot think of an a priori way of spelling out what the appropriate structure must be, I attempt to identify it in an a posteriori manner. I’d be quite happy if I turned out to be wrong on this point.



    [i]           Yantis, S. & Jonides, J. 1990. “Abrupt Visual Onsets and Selective Attention: Voluntary versus Automatic Allocation.” Journal of Experimental Psychology: Human Perception and Performance 16: 121-134; Theeuwes, J. 1991. “Exogenous and Endogenous Control of Attention: The Effect of Visual Onsets and Offsets.” Perception & Psychophysics 49(1): 83-90; Yantis, S. 1993. “Stimulus-Driven Attentional Capture and Attentional Control Settings.” Journal of Experimental Psychology: Human Perception and Performance 19(3): 676-681; Here are some more recent sources: Wright, R. & Ward, L. 2008. Orienting of Attention. Oxford: OUP; Carrasco, M. 2011. “Visual Attention: The Past 25 Years.” Vision Research (51): 1484-1525; Ruz, M. & Lupianez, J. 2002. “A Review of Attentional Capture: On its Automaticity and Sensitivity to Endogenous Control.” Psicologica 23: 283-309; Forster, S. & Lavie, N. 2008. “Attentional capture by entirely irrelevant distractors.” Visual Cognition 16; Kawahara, J., Yanase, K. & Kitazaki, M. 2012. “Attentional capture by the onset and offset of motion signals outside the spatal focus of attention.” Journal of Vision 12: 1-13

    [ii]           Cf. p. 208 of Corbetta, M. & Shulman, G. 2002. “Control of Goal-Directed and Stimulus-Driven Attention in the Brain.” Nature Reviews Neuroscience 3: 201-215

    [iii]          The best models concern free viewing or tasks in normal scenes that do not contain such extreme stimuli as the ones mentioned earlier. Cf. Geisler, W. & Cormack, L. 2011. “Models of Overt Attention.” In S. Liversedge, I. Gilchrist & S. Everling (eds.), The Oxford Handbook of Eye Movements (pp. 439 – 454), New York: OUP; cf. also Itti, L. & Koch, C. 2000. “A Saliency-Based Search Mechanism for Overt and Covert Shifts of Visual Attention.” Vision Research 40: 1489-1506; Itti, L. & Borij, A. 2014. “Computational Models: Bottom-Up and Top-Down Aspects.” In A. Nobre & S. Kastner (eds.), The Oxford Handbook of Attention (pp. 1122-1158), Oxford: OUP.

    The Reynolds model, as I have understood it, is a model of the attentional modulation of the activity of individual neurons. It is not a model of attention shifts. Why does Peter think that it bears on the issue at hand?

    [iv]          Hayhoe, M. & Ballard, D. 2005. “Eye Movements and Natural Behavior.” Trends in Cognitive Science 9: 188-194; Ballard, D. & Hayhoe, M. 2009. “Modeling the Role of Task in the Control of Gaze.” Visual Cognition 17: 118501204; Land, M. 2009. “Vision, Eye Movements, and Natural Behavior.” Visual Neuroscience 26:51-62

    [v]           Indeed, there was a fairly extensive debate about how exactly to characterize attention ‘capture’. Some psychologists proposed to characterize it in terms of search-slopes, others in terms of automaticity. Neither characterization seems to clearly rule out that individuals might guide the relevant processes. I side-step this debate in my paper. For a fuller discussion of capture and the issues in this and the last paragraph, cf. Buehler, D. 2014. Psychological Agency – Guidance of Visual Attention, UCLA Dissertation, Chapter 3.

    [vi]          Note that even if it turned out that all attention shifts are influenced by endogenous factors, the proposal I make as to which endogenous factor correlates with guidance might still turn out to be true.

    [vii]          Najemnik, J. & Geisler, W. 2009. “A Simple Summation Rule for Optimal Fixation Selection in Visual Search.” Vision Research 49: 1286-1294; Zelinsky, G. 2008. “A Theory of Eye Movements during Target Acquisition.” Psychological Review 115(4): 787-835; Pomplun, M., Reingold, E. & Shen, J. 2003. “Area Activation: A Computational Model of Saccadic Selectivity in Visual Search.” Cognitive Science 27: 299-312; Eckstein, M. 2011. “Visual Search: A Retrospective.” Journal of Vision 11(4): 1-36

    [viii]         A fuller discussion of recent models of visual search and their neural underpinnings can be found in chapters 3 & 4 of my 2014 Psychological Agency – Guidance of Visual Attention, UCLA Dissertation.

    [ix]          Cf. the literature in Fn. 7

    [x]           Mark’s worry partly overlaps with one of Sebastian’s, I believe. For a fuller treatment, cf. chapter 2 of my Buehler, D. 2014. Psychological Agency – Guidance of Visual Attention, UCLA Dissertation

    [xi]          I believe that philosophers have tended to ignore or downplay research on the system due to a set of influential criticisms of this research by Allan Allport and Daniel Dennett in the 90s. (cf. Allport, A. 1993. “Attention and Control: Have We Been Asking the Wrong Questions? A Critical Review of Twenty-Five Years.” In D. E. Meyers & S. M. Kornblum (eds.), Attention and Performance XIV: Synergies in Experimental Psychology, Artificial Intelligence, and Cognitive Neuroscience (pp. 183-216). Cambridge, MA: MIT Press; Dennett, D. 1994. Consciousness Explained. Boston: Little, Brown & Co; Monsell, S. & Driver, J. 2000. “Banishing the Control Homunculus.” In S. Monsell & J. Driver (eds.), Control of Cognitive Processes: Attention and Performance XVIII (pp. 3-32), Cambridge, MA: MIT Press). Current rejections of this research are typically not based on a serious engagement with recent results in the field.

    [xii]          Miyake, A., Friedman, N. P., Emerson, M. J., Witzki, A. H., Howerter, A., & Wager, T. D. (2000). “The unity and diversity of executive functions and their contributions to complex “frontal lobe” tasks: A latent variable analysis.” Cognitive Psychology, 41: 49–100

    [xiii]         Friedman, N. & Miyake, A. 2004. “The Relations among Inhibition and Interference Control Functions: A Latent-Variable Analysis.” Journal of Experimental Psychology: General 133(1): 101- 135

    [xiv]         Gazzaley, A. & D’Esposito, M. 2006. “Unifying the Prefrontal Cortex: Executive Control, Neural Networks, and Top-Down Modulation.” In B. Miller & J. Cummings (eds.), The Human Frontal Lobes. Functions and Disorders. (pp. 187-207) New York: Guilford Press; Munakata, Y., Herd, S., Chatham, C., Depue, B., Banich, M. & O’Reilly, R. 2011. “A Unified Framework for Inhibitory Control.” Trends in Cognitive Sciences 15(10): 453-459

    [xv]          Anderson, V., Jacobs, R. & Anderson, P. (eds.) 2008. Executive Functions and the Frontal Lobes. A Lifespan Perspective. New York: Taylor & Francis; Conway, A., Jarrold, C., Kane, M., Miyake, A., & Towse, J. (eds.) 2007. Variation in Working Memory. Oxford: OUP; Friedman, N., Miyake, A., Young, S., DeFried., J. Corley., R. & Hewitt, J. 2008. “Individual Differences in Executive Functions are almost entirely Genetic in Origin.” Journal of Experimental Psychology: General 137(2): 201-225; Munakata, Y., Snyder, H. & Chatham, C. 2012. “Developing Cognitive Control: Three Key Transitions.” Current Directions in the Psychological Sciences 21(2): 71-77; Kurzban, R., Duckworth, A., Kable, J. & Myers, J. 2013. “An Opportunity Cost Model of Subjective Effort and Task Performance.” Behavioral and Brain Sciences 36(6): 661-726; Zaitchik, D., Iqbal, Y. & Carey, S. 2014. “The Effect of Executive Function on Biological Reasoning in Young Children: An Individual Differences Study.” Child Development 85(1): 160-175

    [xvi]         Cf. fn. 11.

    [xvii]         Baddeley and others thought of the central executive system activity as necessarily involving attention, even consciousness. This is not the notion I am interested in. I am more interested in the central executive system qua coordinator and integrator that also has limited resources available for its work.

    1. Thanks for your reply, Denis! I wanted to follow up a bit. You say that the claim that appeals to the central executive are not explanatory is an outdated one. But it seems to me that on Baddeley’s way of thinking, it’s just a central part of the concept of the central executive that appeals to it won’t be explanatory. So it’s hard to see what sort of, say, empirical discovery would outdate the claim. Instead, something rather central about the concept itself would have to change.

      About breaking down the central executive into components and then explaining those components: this does seem like progress. But – as I understand Baddeley’s way of thinking – the thought is that once psychologists have identified and explained all those components, then they should stop making use of the concept of the central executive, and instead use the concepts of those better understood component parts.

      Is your thought that he is wrong about this, and that there is some sort of way these components are unified that would justify continued use of the concept of the central executive?

      1. Maybe we think differently about how concepts such as that of the central executive system work. You seem to want to appeal to something like ‘Baddeley’s definition of the concept’ when you say that empirical discovery couldn’t show that the concept is explanatory after all. Do I get that right? (I’d be interested to hear more about your views on this issue!) I am not sure that this is how concepts in psychological explanations work. Here is how I tend to think about the relevant concept. Baddeley introduced a notion into the debate (actually, it may be due to Broadbent, in this case). In doing so, he ‘got on to’ a psychological system – a set of executive functions for the control of cognitive processes. Both the referent of the notion and the referent’s characteristics have been precisified in subsequent research.

        As I pointed out, there has already been progress identifying sub-components of the system. I mentioned the executive functions of switching, maintenance, and inhibition. Doesn’t that mean that, already, we can make do with talk about the central executive system and instead confine ourselves to talk about these three functions? In some cases, we may explain all that we want to explain by doing so. In other cases, grouping the components together may be illuminating. There is currently no reason to think that executive functions, as we now think of them, will not exhibit sufficient unity for us to group them together and refer to them as belonging to the same kind. But is there any positive reason for treating them in this way? One reason is that they all function to control the execution of cognitive processes. Another reason is that there is behavioral evidence for unity among the executive functions. Apparently, the execution of different executive functions taps into one and the same underlying resource. Finally, there is reason to think that one prominent neural network – involving prefrontal cortex – implements executive functions in mammals.

        This is not to say that we might not turn out to be wrong about this. Elements of our theories of the executive system may well turn out to be mistaken. Maybe future science gives us reason to think that there really is no such thing as a set of executive functions. Maybe it will turn out that the apparent unity that we seem to see now was an illusion. My point is that there currently seems to be no reason to be worried about this possibility.

        1. Thanks, that’s helpful! To continue a bit – you say:

          “Here is how I tend to think about the relevant concept. Baddeley introduced a notion into the debate (actually, it may be due to Broadbent, in this case). In doing so, he ‘got on to’ a psychological system … ” .

          What I find surprising about this is that in your view, with Baddeley’s use of ‘the central executive’, he got on to a single psychological system despite his explicit intention not to do so. He seemed to want to use ‘the central executive’ to refer to a gerrymandered set of things. I don’t think that your interpretation of what’s happened is necessarily incorrect. But I do think that we should want an account of how and why the referential intentions of Baddeley and other psychologists have failed in this surprising way.

          I know that you talk more about these issues in your second chapter, so I will take a look – it’s very interesting!

          1. Hey Mark, what do you base your diagnosis of Baddeley’s referential intentions on? I am not so sure, if you look at earlier work, that he had any intention NOT to discover a unified system or set of functions. He just seems to be ok with that possibly turning out to be so. Also, I appeal to research by Miyake and others. So if anything, their referential intentions should matter.

            But again, the picture of scientific concepts that you at least seem to suggest strikes me as not particularly compelling. I do think that what such concepts mean or are about is not determined by the intentions of the guy who first used the word that expresses the concept, but by the kind that the concept gets on to (and to some extent, its use in the scientific community). What do you think about this?

  5. This is a comment on Peter Fazekas’ comment. Peter, you imply that no attention shift is a form of guided behavior in the relevant sense, or that – as you put it – “shift of attention is always captured by the stimulus with the highest salience.” The evidence you cite simply does not establish this, or even make it plausible. What the evidence suggests is that attention shifts are controlled by a priority map that always gets input both from bottom up as well as from top down. But that simply doesn’t undermine that some attention shifts are guided by the subject or suggest that attention is always captured.

    (What it probably does suggest is that the line between behavior that is individually guided and behavior that isn’t is either quite blurry or maybe even a matter of degree – of which I favor the latter).

    Consider a standard display: two identical Gabor patches, one left, the other right (nothing else on the computer screen). Suppose that there is a pointer that runs around a clock (one turn per minute) – like in the Libet experiments – though you don’t really need the pointer. The subject freely decides when to shift attention either left or right. There is no evidence that subject’s cannot do this (a lot of evidence that they can). Of course, like in all cases there are lots of factors (prior reward and experience among them) that will influence which way subject’s will go (that’s what the evidence shows, and – indeed – that’s what determinism implies), but that does not show that the shift wasn’t guided by the subject, or that this is a case of (passive) attention capture. I can’t see how one could make a claim like Peter’s about a case like this, without also claiming that the subject in a Libet experiment isn’t actively pressing the button (i.e. that pressing the button isn’t an actively guided form of behavior). The interplay between bottom up and top down factors (both of which are not a unified class), will be present in the control of bodily behavior just like in the case of attention. Indeed, just take saccading as your bodily behavior and we know all the relevant evidence. I take it as a reductio of the Peter’s claim that no attention shift is a form of guided behavior that on his view no saccade is a form of guided behavior either. So, Denis main example still stands.

  6. Thanks for this great response, Denis. Very helpful. With respect to the routine actions: I think we should accept that as actions become more routinized they require less and less central executive control (also no background organizing role). Think of running on a path you’ve run a thousand times before, or playing a piece of music you’ve played a thousand times before. While doing those things your mind might be free to do lots of other things. For those routine actions (like for what you call reflex actions) we need one of your other three options (2-4). I think 3 + 4 are giving up too much, so I’d like to know a bit more about 2. I worry that “easily available for executive control” isn’t so well defined. Think of the running case. I am not sure it is easy to apply executive control to running. Overlearned activities are often hard to change or control. So, I’d like to hear a bit more hear.

    1. As I said, I am a little less sure which of 1-3 will turn out to be true. You seem to be more committal here. Is that right? What is the commitment based on? I am wary of equating central executive control with attention or consciousness. And I am wary of trusting my intuitions about cases such as your running example. It seems to require empirical research to decide whether the central executive does not after all play some background organizing role, even in such episodes. Or do you think I am overlooking something, here? Really, what I am curious to learn more about is, what kind of evidence do you expect will decide the matter?

      I agree that the remarks about ‘easy availability’ are – as you put it so diplomatically – less than well-defined. The way I was thinking was very much along the lines you suggest in your first set of comments. There are old models (such as the Shallice 1988 model) of an executive system that is not involved in routine action, but will get involved as soon as a routine action encounters an obstacle or unexpected circumstances. Presumably, if there were some truth to these models, we should expect fairly close neural connections between networks responsible for routine action and executive control networks. Presumably, ‘availability’ would entail fast access of executive control to routine action, when needed. Appeal to these models does, of course, not do much to sharpen the idea of availability. But they may provide a start. Does that help a bit? Any idea how one might do better?

    2. Thanks to Denis for an extremely interesting paper, and to the commenters for their in-depth discussion.

      The topic of ‘routinization’, or automatization, of actions touches on issues of skill-acquisition, and this is an interesting point of contact between this discussion and others in the session.

      I’d like to suggest that something like Denis’ options 1 or 2 holds promise. It seems that routinized actions, however overpracticed, still must be available for some executive guidance, insofar as the well-trained guide is able to manipulate higher-order properties of the performance, or to avert novel and unexpected performance errors.

      In Sebastian’s example of playing a well-known piece of music, the guide retains the ability to ‘do things’ to the performance (e.g. play the piece faster or slower, alter its dynamics by highlighting different notes, etc.), and the more routinized the piece, the more available these modifications should be. And the guide is also able to recognize when something is going wrong: in Sebastian’s other example, the running path may have some new puddles from last night’s rain, or garbage that someone left there, or may be used by other people, and well-trained agents should be able to avoid these obstacles.

      For both manipulating higher-order properties and averting obstacles to performance, some sort of executive monitoring seems to be necessary. This monitoring may work by (1) executive control retaining some ongoing minimal involvement in performance, or by (2) the performance being directly available to executive control. But at any rate, even in the most routinized cases, the agent seems to retain some level of improvisational and corrective ability, which seems rather hard to explain without appeal to some executive function.

      (Now, I don’t know whether we could say the same thing about what Denis calls “reflex actions”. Perhaps the strategy there should be to deny their status as ‘actions’, and just call them reactions… Pulling your hand away from a hot stove seems explainable via merely exogenous mechanisms.)

      1. Thanks Pablo for your remark! Do you know of research that explicitly addresses executive involvement in these cases? I do not think of executive functions as involving the individual’s reflection. Do you?

        1. Hi Denis!

          First, I just want to reiterate that it’s been a delight reading your paper and following the discussion.

          On the latter question: I defined ‘reflection’ as any process involving the executive functions, so… yes, but by definition. (In this I’m mainly following J. St. B. Evans (2010), but this session has made me question whether it’s worth it, just because it generates a lot of confusion.)

          Concerning empirical research explicitly addressing executive involvement in these cases, the first thing that comes to mind is the evidence for parsing that Ellen discusses in her paper.

          On a more conceptual point, Ellen just made the very interesting comment (in the thread on her paper) that, even if online executive control isn’t involved in a given skilled performance (say, because it’s quite familiar), the motor representations involved do retain a structure that was developed in previous instances of executive control. I found this to be an interesting idea, that may help illuminate how executive guidance (as opposed to control) is involved in habitual actions: even if not actively involved, executive control can get involved given that the motor representation has this intelligent structure, that parses it into multiple bits.

          Evans, J. S. B. (2010). “Intuition and reasoning: A dual-process perspective.” Psychological Inquiry 21:4, 313–326.

  7. Thanks for the response Denis, and thanks for the further comment Sebastian.
    I am afraid my point has been misinterpreted, which, of course, is totally my mistake. So let me clarify.

    First, I have never intended to imply that “no attention shift is a form of guided behavior” (quote from Sebastian). I admit, the sentence “shift of attention is always captured by the stimulus with the highest salience” that Sebastian quotes is easy to understand in this way, but it was intended to be in line with the preceding paragraph in the original comment, in which I describe what we know about the mechanism behind shifts of attention. We all agree that exogenous factors, executive control, and other endogenous factors together determine salience values, after which attention is shifted to the stimulus whose representation has the highest salience on the salience map (this is the sense in which ‘salience’ in the sentence quoted by Sebastian was intended to be read, not in the sense of physical salience). In accordance with this, the unfortunate use of ‘capture’ was not a reference to bottom-up captured attention, but to the fact that once the salience map is computed, attention will be shifted to the stimulus with the highest salience value (on the map) automatically, regardless whether it is a stimulus targeted by the executive system.

    Second, I definitely do not need the strong claims that Denis and Sebastian read into my comment for my argument to work. My concern is the general correlation claim Denis makes: “central executive control correlates with primates’ guidance of attention shifts”. Denis’ own example is visual search. I argue that typically (or as I put it in the original comment “under realistic conditions”) in visual search, although the central executive system does play the role Denis attributes to it, shifts of attention do not seem to be guided. But if there are cases in which central executive control is in operation, but are nevertheless not instances of guided behavior, then it is not true that central executive control correlates with guidance, and it is not clear in what sense guidance could be explicated in terms of central executive control.

    So what is my problem with visual search? On the one hand, Denis, Sebastian and I are in agreement on the role of the central executive system. When attention lands on a non-target, the executive system initiates a reallocation, and modulates the computations that output the values of the salience map. So exactly the right kind of central executive control, which is supposed to help us understand guidance, is in play here. However, on the other hand, it only plays a role in computing salience values. The shifts of attention themselves are driven by the salience map itself: attention will be shifted to the stimulus with the highest salience value no matter which stimuli were targeted during the task. Denis thinks that it is enough to think of shifts of attention as being driven toward the goal. I disagree. Consider the set-size effect, i.e. the phenomenon that in visual search response time increases as the number of distractors (non-targets) increases. With a lot of distractors present attention lands on non-targets several times before it finally lands on the target. Is this series of non-target hits such that the non-targets consecutively attended can be said to be closer and closer to the target according to any metrics based on any feature space? I am not aware of any results supporting this claim. Or think about playing ‘Where is Waldo?’. The series of misses before spotting Waldo doesn’t feel like gradual progress, or being driven toward finding him. It feels more like trial and error. Shifts of attention in such (typical) cases of visual search do not seem to be driven toward the target.

    Lacking any clear definition of guidance Denis asks us to rely on our intuitions. I do not share Denis’ intuition that a tour guide offering such a trip is someone who I should follow. And if instead of our intuitions we try to rely on Frankfurt’s original characterization of guidance, the situation, as I have argued in the original comment, is no better. I conclude that shifts of attention in important cases of visual search (where executive control is nevertheless in play) are not guided.

    A final word on Sebastian’s reductio. As I’ve tried to emphasize above, the crucial thing is not that there is an interplay between bottom-up factors and executive control, but rather how the interaction works and what role the contribution of executive control plays in the actual mechanism. According to my interpretation, in typical cases of visual search the contribution of goal representations in directing shifts of attention is not enough for such shifts to be labeled guided behavior.

    1. Hi,

      First off, Denis: I find your presentation (and your research more generally) extremely interesting. I have a huge number of questions for you, and obviously I have a lot to learn from reading your work! I’m going to try reading your dissertation this week, as it’s relevant to what I’m working on now.

      I just wanted to ask a question about the choice of visual search as an example of individual-guided behaviour. I’ve been reading Peter’s comments and I’m sympathetic to some of the points he’s been raising about the automaticity of (most?) attention shifts in realistic conditions, and about the way some researchers think about the role of the salience map or priority map in determining how they occur. (The terminology ‘salience map’ and ‘priority map’, by the way, isn’t well-regimented as far as I know. I think some researchers seem to use these terms interchangeably, while others seem to use them as theoretical competitors, so that the priority map is meant to replace the older notion of a salience map by giving a more central role to broadly ‘top-down’ factors in determining the state of the map. I’m curious if anyone disagrees and would like to explain this for me, since I’m keen to know how these terms get used in the empirical literature).

      So, here’s my question: if Denis is looking for an intuitively compelling example of individual-guided attention, then I wonder if visual search tasks are really the best candidate, and I wonder if the choice of example could be giving Peter the upper hand in this conversation. Just intuitively, the problem with visual search (it seems to me) is that the ‘guide’ doesn’t know where the target is. Background knowledge and the representation of the goal can influence the efficiency of the search in various ways, but it seems to me hardly analogous to the mountain guide, who (let’s suppose) has a lot of knowledge about where the goal is in relation to him, the different routes that are available to him, etc. Just intuitively, there will be a lot more trial and error or stabbing in the dark involved in a Where’s Waldo-type search than in many other sorts of task. And there are other kinds of tasks that we could consider: e.g., ones investigated in ‘natural vision’ studies in which subjects perform well-practiced tasks in familiar, well-understood environments (e.g., making a sandwich, playing a ball game, etc.) or, to use an example Wayne Wu likes to discuss, looking at a painting with the aim of answering questions about it (Yarbus). The results of these eye-tracking studies are always quite striking: the saccades (and hence attention shifts) are manifestly goal-directed. (And some researchers have recently been arguing that those results aren’t well-explained by ‘salience-based’ accounts of attention). So, I’m wondering: if we focus on a different kind of visual task — one in which the central executive is intuitively in a better position to act as a guide, given background knowledge about the scene and where the relevant sources of information are likely to be found — will this help Denis in this debate with Peter? Unless I’m underestimating the power of the central executive (which I might be!), it just seems unfair to expect a ‘guide’ to direct attention toward the target in such a situation, given that the guide doesn’t know where it is. This doesn’t preclude the use of other intelligent strategies for getting there more quickly than one would otherwise, but we can’t expect the guide to do what’s being asked of it. On the other hand, I would think there are lots of other tasks in which the central executive is well-positioned to shift attention toward task-relevant information and not just to the most salient location. And maybe in those other situations the attention shifts do satisfy Peter’s demands on what it would take for attention to be guided.

      1. Hi Aaron, thanks for your interest! Let’s def be in touch, if you want. I’d like to hear about your work, too.

        First of all, yes, I use ‘priority map’ deliberately, because I think it better captures the idea that many different factors – other than the traditional physical saliency – can influence what locations have priority for shifting attention to them.

        Now to your question. I still think that the case of visual search is a good case. It seems (a) clear that we guide visual search, even though (b) we cannot spell out guidance in terms of ‘knowing the destination.’ Finally (c) (visual) search is of absolutely central importance to so many of our activities.
        We are clearly not randomly moving attention around when we engage in visual search. Behavioral data and computational models (have a look at the Geisler references, if interested) show that our attentional system searches in highly efficient and successful ways. True, sometimes it may feel as though we’re just randomly looking around in desperate search for our keys, but actually many factors contribute to making our search more efficient.
        People often tend to think of guidance toward a goal in terms of knowing the destination. That is a natural way of thinking, but it is also too simplistic, I believe. One the one hand, of course, there is an element of that also in visual search. I do know what my keys look like, I look for similar things, and I do so until I’ve found them (I also may apply strategies for finding them, but bracket that). On the other hand, some of the most important activities that we set out to accomplish – foraging, remembering, problem-solving – are such that we cannot have full knowledge of their goals. Otherwise they would be pointless. But they do neither feel random, nor are they, as a matter of scientific fact, random.
        Our embarking on any of these activities activates and shapes subsequent search processes in ways that, by and large, allow us to be fairly successful at reaching our goals. And it is easy to appreciate how important these activities are for understanding what we are up to on a day to day basis.

        1. I see. Thank you, Peter and Denis. I jumped in without properly understanding the dialectic. I think I now see that my point was actually supporting Peter (in fact, possibly repeating him!), and that my observation doesn’t help Denis in this context. I’m not sure where my sympathies lie just yet!

          Stepping back, I’m curious to get clearer on the disagreement between you two. Peter’s claim (correct me if I’m wrong) is that executive control doesn’t correlate with the individual’s guidance, because, while executive control is indeed involved in controlling attention during visual search (by influencing the state of the salience/priority map), it doesn’t guide attention to its goal. Denis’s claim is that they do correlate, and that central executive does guide attention in visual search. Assuming that what Peter means by ‘salience map’ is what Denis means by ‘priority map’ and that you both agree about how executive control informs the state of that map, the disagreement between you two seems to be about how to understand guidance. In particular, I have a feeling this passage from Peter is revealing about how Peter thinks about guidance, and where I suspect Denis can jump off:

          “Consider the set-size effect, i.e. the phenomenon that in visual search response time increases as the number of distractors (non-targets) increases. With a lot of distractors present attention lands on non-targets several times before it finally lands on the target. Is this series of non-target hits such that the non-targets consecutively attended can be said to be closer and closer to the target according to any metrics based on any feature space? I am not aware of any results supporting this claim. Or think about playing ‘Where is Waldo?’. The series of misses before spotting Waldo doesn’t feel like gradual progress, or being driven toward finding him. It feels more like trial and error. Shifts of attention in such (typical) cases of visual search do not seem to be driven toward the target.”

          There are a few ideas here, but one of them is the assumption that guidance control entails “gradual progress” toward a goal. Given the implicit contrast with trial and error, I take it that gradual progress doesn’t simply consist in the elimination of possibilities (e.g., “Is this the target? No. Is this it? No. This? Nope …”), since the latter looks like trial and error (combined with some memory of which locations one has checked). Given that gradual progress doesn’t simply consist in the elimination of possibilities, it would be worth clarifying what this gradual progress requirement consists in and whether it’s essential for guidance. I have a feeling, from what Denis said in reply to me, that he would reject the requirement (cf. Denis’ denial that guidance requires knowing one’s destination. He wants guidance to accommodate more open-ended instances of problem solving).

    2. My idea was that visual search is a clear case in which individuals guide their attention shifts toward a goal – the goal of their search. Suppose that I am searching for a banana. How is my guidance implemented? Executive control implements my setting of the search goal (the banana) and thus controls computations of priority on the priority map (based on similarity of stimuli at locations in the scene to the desired banana). Attention will shift to locations based on priority values for different locations on the map. Similarity to the banana will shape the computations for where to shift next until the goal of the search is reached – until I’ve found the banana. There is no guarantee that attention will shift to the right location (the one with the banana) immediately, or without fail. Indeed, attention often gets it wrong. How wrong attention gets it (and how long it takes to find the banana) depends on the size of the sample, the number of distractors, noise in display and system, and many other factors. But computational models of visual search tell us that the process is very far from being trial and error (cf. the references in my last reply). It is in virtue of the way in which executive control shapes how priority is computed, that we can say that attention is driven toward the search goal.

      You say that you disagree with my proposal. I am not entirely sure I’ve understood why you disagree. Do you think that, intuitively, attention shifts during search are not guided? Do you think that any automatic process could not be guided? Do you think that any process that misses its goal (maybe by a lot) could not be guided? Do you think that any process involving exogenous factors could not be guided? None of these commitments strikes me as promising. Or were exactly do we disagree?

      1. Hi Denis,

        We agree on how executive control operates in visual search, and how it is able to modulate the computations producing salience or priority values. I think we disagree whether this is enough to implement guidance of shifts of attention. I don’t think that it is. What such a modulation can achieve is that it ‘pushes’ the target representation higher up on the priority list. Let’s say that the top is position ‘1’, and executive control modulation can push target representation to position ‘n’ (whereas without such a modulation is would be lower, say in position ‘n+k’). Now even if it is true, I think that shifts of attention during search as it is reallocated to locations corresponding to positions from 1 to n, and finally settles on the target is guided neither in an intuitive, nor in a Frankfurt-based sense. Not because they are random — they are not, that’s exactly what the priority map ensures — but because, to use your criterion, they do not get ‘closer’ to the target (not just spatially, but according to any metric) as they shift from position n-i to n-i+1. Or put it in Frankfurt’s terms: as attention shifts in correspondence with moving through the priority list from position 1 to n “its course is not subject to adjustments which compensate for the effects of forces which would otherwise interfere with the course of the behavior” (Frankfurt, 1978, p.160).

        1. Hey Peter,
          I did not mention the criterion (getting closer to the target by some metric) that you attribute to me. It is not clear to me that it is a condition on guidance. It so happens that it can be easily met in the case of visual search, though. I’ll get back to that below.

          I’d be mostly curious to hear your answer to the questions I had for you in my last post: on what basis do YOU deny that search is guided? Or would you say that the above mentioned criterion does capture your view?

          Let me repeat some points from earlier:
          1. The attentional system is set so as to be ‘looking for’ things similar to the banana (task-driven attentional routines like the ones that Wayne and Aaron mention often are, or involve visual searches; so if you’re not acknowledging guidance in visual search, you should not do so in many of those cases either).
          2. The search performed on this basis is highly efficient and increases the probability of finding the target vis-a-vis trial-and-error searches, random searches, or merely saliency-based computations. Indeed, the probability of finding the target with the NEXT shift increases with each shift. We learn this from the most advanced models of visual search that I quote in my earlier replies. Here’s an obvious sense in which attention gets ‘closer’ to the target, if you like that criterion. (The relevant computations also suppress the influence of e.g. physically salient distractors on where attention shifts. So here’s also a sense in which “its course is … subject to adjustments which compensate for the effects of forces which would otherwise interfere with the course of the behavior,” as Frankfurt suggests it should be.)
          3. There is no reason to think, certainly none that you have provided, that under these circumstances we would not speak of guidance - or do you seriously want to deny that someone could not guide the search-expedition for Pablo Escobar, unless they already knew where Pablo Escobar is hiding? (Anybody’s guess what show I’ve been watching last night.)

          So again, how are these considerations mistaken? What precisely are the alternative considerations in favor that visual search is never guided?

  8. Hi Aaron,
    Thanks, great comment! It illuminates really nicely my point that the source of the problem is what role executive control can play in the actual mechanism of shifting attention.
    I certainly think that my worries apply only to the specific case of visual search, so I am open, and, in fact, sympathetic to the idea that there are other visual tasks (like Wayne’s example) where shifts of attention are indeed guided, and executive control (restricted to what it does in those cases) is in a better position to correlate with guidance.

  9. Hi Denis,

    I’m sure you’re busy answering all of these other comments, but I just had a quick question of clarification about your view. And my apologies if you answer the question in your paper.

    What’s the relationship, on your view, between an agent’s intention to act in some way and their central executive system? A natural view, it seems to me, would be to link the two, so that the central executive helps implement or realize an agent’s current intention. Your objections to traditional action theories seem to suggest that you would not be sympathetic to such a proposal, since you think that intentions are too intellectually sophisticated to explain the agency of non-human animals. What do you say to someone who just equates intention with the executive functions (or some subset of those functions) carried out by the executive control system? That would, it seems, bring your empirically-based view into line with a fairly traditional causal theory of action, on which an agent’s guidance of action consists in the guidance of behaviour by an intention. It would be a consequence of the proposal that many non-human animals have intentions (given that they have a central executive system), but that seems sensible enough to me. Or do you think there are good reasons to deny that non-human animals possess intentions?

  10. I’m also curious what you would say to someone who objects like this: central executive control guides both active and passive attention shifts. The difference between active and passive attention concerns which executive functions are present and which are absent. In active shifts, attention is guided by a goal representation which is maintained in working memory and kept on course by the inhibition of goal-irrelevant information (e.g., distractors). So, active attention exists where there is the presence of maintenance and inhibition and an absence of task-switching. Passive shifts, by contrast, are characterized by the opposite pattern of executive function: an absence of maintenance and inhibition and the presence of task-switching. That is, when your attention is captured, you switch tasks. Assuming that passive attention isn’t agent-guided, then the guidance of attention by executive control doesn’t correlate with agent-guidance.

    Perhaps one could substantiate the details of the proposal by drawing on Corbetta & Shulman’s (2002) distinction between the dorsal attention network and the ventral attention network. The idea would be that the ventral network specializes in task-switches in response to behaviourally significant stimuli, whereas the dorsal network specializes in keeping the agent on task. (Though maybe this way of filling out the empirical details is false).

    1. Hey Aaron,

      On your first question, I would say that (a) I am not sure what it is to equate an intention with an executive function and (b) that, to the extent I get the proposal, it seems in conflict with the empirical facts. Executive control can involve propositional intentions or more image-like representations (and possibly neither). That is one of the strengths of my proposal, as I see it, for it allows us to encompass more animal agency with our account.

      On your second question, I would say that the proposal gets the facts wrong. I like your description of what’s going on in active shifts. But there’s no task switching that controls passive attention shifts. I do not think that appeal to the Corbetta 2002 distinction between different networks bears on this issue (remember the research that I base my characterization of the executive system on).

      I say more on both topics in the paper.

      1. Thank you, Denis. As I mentioned, I’ll be taking a closer look at your work soon.

        Just to elaborate on the first idea (about the potential connection between executive control and the subject’s intention), my thought was roughly this. At its core, you might think an intention is a type of representational state that plays a certain set of roles in the subject’s mental economy and in behavioural control. Which roles? One answer might be: the roles that you’ve identified with the central executive. If a goal representation plays the executive roles (e.g., if it’s maintained in working memory, if it inhibits goal-irrelevant information from consuming central resources, if it integrates and coordinates one’s other states and competencies in a way that tends, over time, toward its own fulfillment), it satisfies the functional description of an intention. Intention just is an executive goal representation.

        That’s the line of thought I was exploring. But if you’re right that states other than intention can play the same executive functions, then obviously the hypothesis I’m exploring is a non-starter. I just wonder what would disqualify those states from being intentions.

Comments are closed.