Local and Global Experts

Contemporary epistemology is replete with principles of expert deference.  Epistemologists have claimed that you should treat the chances, your future selves, your rational self, and your epistemic peers as experts.  What this means is that you should try to align your credences with theirs.

There are lots of ways you might try to align your credences with those of some expert function.  (That expert function could be the chances, or it could be your future credences, or something else altogether.  The particular function won’t matter, so I’ll just call the expert function, whatever it is, “\mathcal{E}”.)  My focus here will be on just two ways of aligning your credences with \mathcal{E}‘s: 1) by treating it as a local expert; and 2) by treating it as a global expert.

Local Expert. You treat \mathcal{E} as a local expert iff, for all propositions a, and all numbers n \in [0, 1]

C(a \mid \langle \mathcal{E}(a) = n \rangle) = n, \,\,\,\, if defined

Global Expert. You treat \mathcal{E} as a global expert iff, for all propositions a, and all potential credence functions E,

C(a \mid \langle \mathcal{E} = E \rangle) = E(a), \,\,\,\, if defined

In these definitions, C is your own credence function.  You should read “\mathcal{E}” as a definite description, along the lines of “the credence function of the expert”.  This definite description may refer to different credence functions at different worlds.  And I am using the brackets “\langle \,\, \rangle” to denote propositions.  Thus, “\langle \mathcal{E}(a) = n \rangle” is the propositions that the expert’s credence that a is n.  It is true at those worlds where \mathcal{E}‘s credence in the proposition a is n.  And \langle \mathcal{E} = E \rangle is the proposition that E is the expert’s entire credence function, true at those worlds w such that \mathcal{E}_w = E (“\mathcal{E}_w” is \mathcal{E}‘s credence function at world w).

It’s not immediately obvious what the relationship is between these two different ways of treating a function as an expert.  You might think that they are equivalent, in the sense that you will treat \mathcal{E} as a local expert if and only if you treat them as a global expert.  In fact, they are not equivalent.  Treating \mathcal{E} as a global expert entails treating \mathcal{E} as a local expert, but the converse is not true.  (Throughout, by the way, I’m assuming probabilism and I’m assuming that your credences are defined over a finite number of worlds).

Proposition 1. If you treat \mathcal{E} as a global expert, then you treat them as a local expert as well.  However, you may treat \mathcal{E} as a local expert without treating them as a global expert.

Proof.  Note that \{ \langle \mathcal{E} = E \rangle \mid E(a) = n \} is a partition of \langle \mathcal{E}(a) = n \rangle.  If you treat \mathcal{E} as a global expert, then for each E such that E(a) = n, C(a \mid \langle \mathcal{E} = E \rangle) = n.  It then follows from conglomerability (which follows from the probability axioms when the number of worlds is finite) that C(a \mid \langle \mathcal{E}(a) = n \rangle) = n .

To see that you may treat \mathcal{E} as a local expert without treating them as a global expert, suppose that there are three possible worlds, w_1, w_2, and w_3, and that the expert’s credence function at each of those worlds is as shown in figure 1 (the example originates from Gaifman’s 1988 article “A Theory of Higher-Order Probabilities”).   (In figure 1, by the way, the ith row gives \mathcal{E}‘s credence distribution over w_1, w_2 and w_3 at the world w_i.)

matrix
Figure 1: A function which may be treated as a local expert, but not a global expert.

And suppose that your own credence distribution over w_1, w_2, and w_3 is such that C(w_i) = 1/3, for i = 1, 2, 3.   Then, for every proposition a and every n, C(a \mid \langle \mathcal{E}(a) = n \rangle) = n.  For instance, if a = \{ w_1, w_2 \} and n = 1/2, then

\begin{aligned} C(\{ w_1, w_2 \} \mid \langle \mathcal{E}(\{ w_1, w_2 \}) = 1/2 \rangle) &= C(\{ w_1, w_2 \} \mid \{ w \mid \mathcal{E}_w(\{ w_1, w_2 \})=1/2 \} ) \\ &= C(\{ w_1, w_2 \} \mid \{ w_2, w_3 \}) \\ &= 1/2 \end{aligned}

And the same is true for every other choice of a and n, as you may check for yourself.  Nevertheless, it is impossible to treat \mathcal{E} as a global expert, since, so long as C is a probability function,

C(\{ w_1 \} \mid \langle \mathcal{E} = \mathcal{E}_{w_1} \rangle) =C(\{ w_1 \} \mid \{ w_1 \}) = 1

But \mathcal{E}_{w_1}(\{ w_1 \}) = 1/2 \neq 1.

\Box

So a principle of local deference is strictly weaker than a principle of global deference.  Or, a perhaps better way of thinking about things: there are strictly more functions which can be treated as local experts than there are functions which can be treated as global experts.

This is a prima facie exciting observation, since a common objection to principles of global deference is that it is possible to treat \mathcal{E} as a global expert if and only if \mathcal{E} is certain of what their own credences are (because the focus is usually on certain ideal credence functions, certainty about your own credences is generally called immodesty).  That is to say, it is possible to treat \mathcal{E} as a global expert if and only if they are immodest—if and only if, for every world w, \mathcal{E}_w(\langle \mathcal{E} = \mathcal{E}_w \rangle) = 1.  For suppose that \mathcal{E} were modest—that is, suppose that, for some world w, \mathcal{E}_w(\langle \mathcal{E} = \mathcal{E}_w \rangle) \neq 1.  And suppose that you treat \mathcal{E} as a global expert.  Then, substituting \langle \mathcal{E} = \mathcal{E}_w \rangle in for a and \mathcal{E}_w in for E in the definition of Global Expert, we have

\begin{aligned} C(\langle \mathcal{E} = \mathcal{E}_w \rangle \mid \langle \mathcal{E} = \mathcal{E}_w \rangle ) = \mathcal{E}_w(\langle \mathcal{E} = \mathcal{E}_w \rangle ) \neq 1 \end{aligned}

But the probability axioms require C(a \mid a) to be 1 (or undefined) for all a.

So: if you think functions which aren’t certain of their own values should nevertheless be treated as experts, then you will think that we need a characterization of “treating a function as an expert” which goes beyond Global Expert.  A common suggestion is to treat \mathcal{E} as a modest expert.

Modest Expert.  You treat \mathcal{E} as a modest expert if and only if, for all propositions a and all potential credence functions E,

C(a \mid \langle \mathcal{E} = E \rangle) = E(a \mid \langle \mathcal{E} = E \rangle)

But perhaps the move to such principles is too hasty.  Perhaps we can get by just with principles of local deference.  For note that the expert shown in figure 1 is modest; yet they can be treated as a local expert.  So there are at least some modest functions which can be treated as local experts.  And perhaps these are all the modest experts we need.

For this reason, the relationship between local and global experts is dialectically important to some debates in epistemology.  For instance, Christensen endorses the claim that you should treat your currently rational self as a local expert.  Elga criticizes this position on the grounds that it requires certainty that you are rational—however, in order to argue for this conclusion, he must first re-present Christensen’s principle as the claim that you should treat your rational self as a global expert (note: Elga recognizes that the second principle is stronger than the first).  Perhaps, in the face of these criticisms, Christensen should hold tight to his original principle; perhaps it affords all the modesty we need.

No such luck, I’m afraid.  Although there are some functions which can be treated as local experts but not global experts, these functions are incredibly singular.  In fact, there is a good sense in which the function shown in figure 1 is the only kind of function which can be treated at a local, but not global, expert.

Given a function \mathcal{E}, from worlds to probability distributions over those same worlds, we can generate a Kripke frame < \mathcal{W}, R > from \mathcal{E} as follows:  \mathcal{E}_w(\{ x \}) \neq 0 if and only if w bears the relation R to x (or, as I shall say, if and only if w sees x).

Let’s say that a Kripke frame < \mathcal{W}, R > is cyclic iff

  1. Every world w \in \mathcal{W} sees itself and exactly one other world.
  2. Every world w \in \mathcal{W} is seen by exactly one distinct world.
  3. There are no two worlds w, x such that w sees x and x sees w.

A sample cyclic frame is shown in figure 2.

cyclic2
Figure 2: A cyclic frame (reflexive arrows have been suppressed)

Note that the function from figure 1 will generate a cyclic frame in which, for each w \in \mathcal{W}, \mathcal{E}_w(w) = 1/2.  Let’s call any function like this a uniform cyclic function (‘uniform’ because at every world \mathcal{E} gives equal probability to its actual world and the one other possible world it sees).

Uniform Cyclicity. A function \mathcal{E} is uniform cyclic if and only if \mathcal{E} generates a cyclic frame and, for every w \in \mathcal{W}, \mathcal{E}_w(\{ w\}) = 1/2.

Now, it turns out that the functions which may be treated as local experts, but which may not be treated as global experts, are precisely the uniform cyclic ones.  If a function is uniform cyclic, then you may treat it as a local expert, but not as a global expert.  And if a function \mathcal{E} is not uniform cyclic, then you can treat \mathcal{E} as a local expert if and only if you can treat it as a global expert.

Proposition 2.  It is possible to treat a function \mathcal{E} as a local expert but not possible to treat them as a global expert when and only when \mathcal{E} is uniform cyclic.

The only credences which treat such a function as a local expert are those which are uniform over the worlds in each cycle.

The proof of this proposition is quite long and tedious, so I’m putting it in a separate document here.

What Proposition 2 means, I think, is that we don’t have to fret about the difference between the local and global formulations of various principles of expert deference.  For what the proposition tells us is that nobody should endorse a principle of local deference without thereby endorsing a principle of global deference.  To endorse a principle of local deference without endorsing a principle of global deference is to say that uniformly cyclic functions are deserving of epistemic deference, but no other immodest function is.  This strikes me as entirely unmotivated.

If we think that you should treat the probability function which generates the cyclic frame in figure 2 as an expert, then we should also think that you should treat the probability function which generates the frame shown in figure 3 as an expert.

cyclic3
Figure 3: No probability function generating this frame may be treated as a local expert.

After all, the only difference between the frame in figure 2 and the frame in figure 3 is that we have taken the single possibility w_1 in figure 2 and divided it into two sub-possibilities w_1 and w_1' in figure 3.  We could suppose that, at all worlds in figure 3, \mathcal{E} gives the proposition \{ w_1, w_1' \} precisely the same probability it gave the singleton proposition \{ w_1 \} in figure 2.  If that’s so, then say that \mathcal{E} reduces to uniform cyclicity.  After all, if we just collapse the possibilities w_1 and w_1', then we get back a uniform cyclic function.  The difference between a uniform cyclic function and a function which merely reduces to uniform cyclicity ought not make any difference with respect to whether some supposed expert is deserving of epistemic deference, nor how that deference ought to be shown.  However, Proposition 2 assures us that such minor changes in representation do make a difference with respect to whether we can treat the function as a local expert.  So, if we’re in for treating some function as a local expert, we shouldn’t demure from treating them as a global expert as well.

So I think that Christensen, e.g., effectively has committed himself to the view that your rational self must be immodest.  While his claim that you should treat your rational self as a local expert does not on its own entail this conclusion, it follows with the rather weak assumption that, if a uniform cyclic function is deserving of epistemic deference, then so too is a function which merely reduces to uniform cyclicity.  Unless Christensen believes that 1) our rational selves could be uniform cyclic, but 2) they could not merely reduce to uniform cyclicity, he should also think that you should treat your rational self as a global expert.  And this entails that your rational self is immodest.

TL;DR: you might have thought that principles of local deference are equivalent to principles of global deference.  They’re not.  Principles of local deference are weaker than principles of global deference.  But they’re really not much weaker—just slightly.  And there’s really no good reason to treat any function as a local expert but not a global expert.  So, while they’re ever-so-slightly different, really, you shouldn’t ever worry about the differences.

2 thoughts on “Local and Global Experts

  1. Nice post, Dmitri. I’m not quite sure about the move to: “So, if we’re in for treating some function as a local expert, we shouldn’t demure from treating them as a global expert as well” given the results here. After all, formally, all we know is that (except for certain pathological cases) any credence function that can be treated as a local expert can also be treated as a global expert. Why should we think that that means that any good local expert *should* be treated as a global expert?

    Here’s a possible case. I know very little about, say, chemistry. I have a friend Bill who I know is a mediocre chemist. Bill knows a lot more about chemistry than I do though. So, if I ask Bill “what’s your credence that X will happen if we mix chemicals Y and Z together?,” I should treat Bill as an expert. However, suppose I learn all of Bill’s credences about chemistry. If I do that, I will learn quite a bit about chemistry myself—e.g., by learning all his credences about O-chem, I can now more or less understand O-chem. So, now I feel that I (assuming I’m smart enough) can make my own judgments about chemistry, and perhaps I can do so better than Bill.

    The general point is that if you don’t know much about a subject matter, somebody can be a local expert because you judge that you can do no better than just listen to her. The underlying theory or workings behind the phenomena are black-box to you, so deferring is your best strategy. If you learn all that person’s credences about the subject matter, the subject matter is no longer black box to you. You can start to figure it out on your own and blind deference may not be your best strategy. So, the person isn’t a global expert.

    Liked by 1 person

    1. Thanks for the reply, Ben!

      “formally, all we know is that (except for certain pathological cases) any credence function that can be treated as a local expert can also be treated as a global expert. Why should we think that that means that any good local expert *should* be treated as a global expert?”

      You’re right that there’s a logical leap here. What I need is the additional proposition, which I assumed but didn’t prove:

      Proposition 3: If an expert {\mathcal{E}} is immodest, then, if you treat {\mathcal{E}} as a local expert, then you treat them as a global expert as well.

      (For, if it is possible to treat { \mathcal{E}} as a local expert but we’re not in the pathological case, then it follows from Proposition 2 that it must also be possible to treat { \mathcal{E}} as a global expert. And it’s possible to treat { \mathcal{E}} as a global expert iff { \mathcal{E}} is immodest. So, it would follow from Propositions 1, 2, and 3 that, if we’re not in the pathological cases, then you’ll treat { \mathcal{E}} as a local expert iff you treat them as a global expert—which is what I took myself to have shown.)

      This proposition is true, though you’re quite right that I should have mentioned it. Here’s a proof: If { \mathcal{E}} is immodest, then it generates an S5 Kripke frame where the expert’s credence function is the same for all {R}-related worlds. That is, the relation { R} partitions { \mathcal{W}} into equivalence classes, where each equivalence class { [w]_R = \langle \mathcal{E} = \mathcal{E}_w \rangle}. Then, for each pair of worlds { w, w' \in \mathcal{W}}, either { \mathcal{E}_w(\{ w' \}) = \mathcal{E}_{w'}(\{ w' \})} (if { w' \in [w]_R}) or { \mathcal{E}_w(\{ w' \}) = 0} (if { w' \notin [w]_R}). So, for each { w \in \mathcal{W}} and each {w' \in [w]_R = \langle \mathcal{E} = \mathcal{E}_w \rangle},

      \displaystyle  \langle \mathcal{E}(\{w'\}) = \mathcal{E}_w(\{w'\}) \rangle = \langle \mathcal{E} = \mathcal{E}_w \rangle

      And thus, if you treat { \mathcal{E}} as a local expert, then, for each { w \in \mathcal{W}} and each {w' \in [w]_R = \langle \mathcal{E} = \mathcal{E}_w \rangle},

      \displaystyle  C(\{ w' \} \mid \langle \mathcal{E}(\{w'\}) = \mathcal{E}_w(\{w'\}) \rangle) \,\,\,=\,\,\, C(\{ w'\} \mid \langle \mathcal{E} = \mathcal{E}_w \rangle) \,\,\,=\,\,\, \mathcal{E}_w(\{ w' \})

      And, for each {w \in \mathcal{W}} and each {w' \notin [w]_R = \langle \mathcal{E} = \mathcal{E}_w \rangle}:

      \displaystyle  			C(\{ w'\} \mid \langle \mathcal{E} = \mathcal{E}_w \rangle) \,\,\,=\,\,\, 0

      So, for every proposition {a} and every {w \in \mathcal{W}},

      \displaystyle  		\begin{aligned} 		C(a \mid \langle \mathcal{E} = \mathcal{E}_w \rangle) &= \sum_{w' \in a \cap \langle \mathcal{E} = \mathcal{E}_w \rangle } C(\{ w' \} \mid \langle \mathcal{E} = \mathcal{E}_w \rangle)	\,\,\,+\,\,\, 	\sum_{w' \in a \cap \neg \langle \mathcal{E} = \mathcal{E}_w \rangle } C(\{ w' \} \mid \langle \mathcal{E} = \mathcal{E}_w \rangle)\\ 		&= \sum_{w' \in a \cap \langle \mathcal{E} = \mathcal{E}_w \rangle} \mathcal{E}_{w}(\{ w' \})	 \,\,\,+\,\,\, 0		\\ 		&= \mathcal{E}_w(a) 		\end{aligned}

      So you treat {\mathcal{E}} as a global expert. QED.

      I think what’s going on in your case isn’t that you want to treat Bill as a local expert but not a global expert. Rather, I think that you only want to treat Bill as an expert about some propositions, but not others. You don’t want to treat Bill as an expert about the proposition which is the conjunction of all facts about O-Chem together with the proposition that X will happen if we mix Y and Z. Call the conjunction of facts about O-Chem ‘{o}‘ and call the proposition that you’d get reaction X when you mix Y and Z ‘{x}‘. Then, conditional on Bill’s credence in {o \wedge x} being 1, perhaps you’d rather not just defer to Bill’s opinion, but rather use the information contained in {o} to make up your own mind about whether {x}.

      I think that’s probably the right attitude to have towards experts—you should defer to them about some matters, but not all matters. (I’ve got a paper arguing that this is the right way to treat chance and your future rational self.) But to take that attitude is to already give up on treating them as even a local expert.

      Like

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s