Understanding Universal Property and Universal Element (from Category Theory in Context, Riehl)

category-theoryuniversal-property

I'm trying to understand universal Property and universal element as stated in Category Theory in Context.

The book states

enter image description here

So from what I understand, since $C(c, -)$ is representable, there exists natural isomorphism between $F$ and $C(c,-)$, call this natural isomorphism $\alpha$ And since by Yoneda, this $\alpha$ corresponds to some $x \in Fc$, which we call the universal element.

Is my understanding correct?

Also, am I correct in saying that universal elements are not unique, since there may be multiple natural isomorphism between $F$ and $C(c,-)$?

Also,

enter image description here

enter image description here

Also, here, then $\otimes$ is not unique, so I would think different $\otimes$ corresponds to different natural isomorphism, and hence different $\overline{f}$, is that right?

Thanks for your help!

Best Answer

I'm adding an answer to this question because I both agree and disagree with trujello's answer in the comments on the question.

I agree that the definition Riehl gives is somewhat opaque for a beginner, but I definitely disagree that the definition is bad. This is the definition that I use intuitively now, though it certainly wasn't the one I first learned (partly because I never saw a clear definition when I was learning, only a lot of examples).

I also think that if you start off with learning this definition you'll avoid a couple common misconceptions that I encountered in the past. For example, this definition makes the role of the universal element clear. A product is not just an object, but an object with a specific family of projections satisfying a property.

Some philosophy on definitions

What makes a definition good? One possible viewpoint is that a good mathematical definition is one that abstracts a bunch of common situations in some area of mathematics and identifies the key feature that allows you to understand them as being all examples of the same underlying concept.

A result of this viewpoint is that the definition can seem opaque when you aren't so familiar with the situations being abstracted to see how they all fit into this same concept. (For example, I've recently been introduced to the concept of triangulated categories, whose examples are the stable homotopy category, derived categories of complexes, and modular representation theory, none of which are particularly familiar to me, which makes understanding the definition a bit more difficult.)

How then can we understand such definitions?

The answer I'll give is in two parts, and I'll try to provide these in this answer. First, understand the examples motivating the definition. A good algebra textbook doesn't just drop the definition of a group on students with no examples, and this definition of universal property is the same. It needs to be packaged with a long list of examples. Second, the connection between the motivating examples and the definition needs to be made explicit.

With that said, let's get to answering the real question asked above, how to understand Riehl's definition, and what exactly is going on with universal elements/their uniqueness.

Motivating examples of universal properties

A note that all of these examples all are described as having universal properties whether presented in Riehl's form or not. Riehl's definition uses the name universal property because that is the name already in use for all the examples. I will therefore sort of have to use it in both the more traditional sense and in Riehl's sense, although they do turn out to be equivalent.

Also, I have a slight quibble of the precise wording of her definition, which may or may not be contributing to your confusion about the uniqueness of the universal element. I would say that the representing object $c$ is a universal object, the element $x\in F(c)$ is the universal element, and the pair $(c,x)$ has the universal property that it represents the functor $F$.

Products (more generally limits)

Let $C$ be a category, $x,y\in C$ objects. A product of $x$ and $y$ is an object $x\times y$, and a pair of morphisms $\pi_1 : x\times y \to x$, $\pi_2:x\times y \to y$ such that the following universal property (not obviously in Riehl's sense) is satisfied:

For all objects $z$, with a pair of morphisms $p_1 : z\to x$ and $p_2:z\to y$, there is a unique morphism $(p_1,p_2): z\to x\times y$ such that $\pi_1(p_1,p_2) = p_1$, and $\pi_2(p_1,p_2)=p_2$.

Relating to Riehl's definition: We need a functor and a universal element. The functor here is $C(-,x)\times C(-,y)$, where $C(x,y)$ is notation for the morphisms from $x$ to $y$ in the category $C$. In other words, this functor sends an object $z$ to pairs of morphisms $(p_1:z\to x, p_2:z\to y)$.

A representing object for $C(-,x)\times C(-,y)$ consists of a pair of an object $x\times y$ and a universal element $$(\pi_1,\pi_2)\in C(x\times y,x)\times C(x\times y,y),$$ such that $(\pi_1,\pi_2)$ defines a natural isomorphism $$C(-,x\times y)\simeq C(-,x)\times C(_,y).$$ This property of the universal element translates into the fact that morphisms $f:z\to x\times y$ correspond bijectively to pairs of maps $(f_1:z\to x,f_2:z\to y)$ via $f\mapsto (\pi_1\circ f,\pi_2\circ f)$, which is the first version of the universal property given in the definition of the product.

Free abelian groups (more generally free objects)

Let $S$ be a set. Let $U$ be the forgetful functor from abelian groups to sets. Let $\newcommand\Z{\mathbb{Z}} \Z\{S\}$ denote the free abelian group on the set $S$, with $[s]$ denoting the basis element corresponding to a given element $s\in S$. The universal property (again, not obviously Riehl's definition, though equivalent) of $\Z\{S\}$ is that for any abelian group $A$, and any map $\phi:S\to UA$, there is a unique morphism $\tilde{\phi}:\Z\{S\}\to A$ such that $\tilde{\phi}([s])=\phi(s)$.

Relating to Riehl's definition The functor here is $\newcommand\Set{\mathbf{Set}} \Set(S,U-)$. The universal object should be $\Z\{S\}$, and the universal element should be a function $\psi : S\to U\Z\{S\}$, which is $\psi(s)=[s]$. The natural transformation corresponding to $\psi$ sends a morphism $\tilde{\phi}:\Z\{S\}\to A$ to the function $s\mapsto \tilde{\phi}([s])$. The fact that this natural transformation is a natural isomorphism yields the version of the universal property above.

Tensor products

Again, we'll give the traditional definition of the tensor product. The tensor product $V\otimes_k W$ of vector spaces $V$ and $W$ over a field $k$ is a vector space with a $k$-bilinear map $\otimes : V\times W\to V\otimes_k W$ such that for any $k$-bilinear map $\phi : V\times W\to U$, there is a unique linear map $\tilde{\phi}:V\otimes_k W\to U$ with $\phi =\tilde{\phi}\circ \otimes$.

Here the functor is $\newcommand\Bilin{\operatorname{Bilin}}\Bilin_k(V,W;-)$, the universal object is $\otimes$, the universal element is $\otimes$, and we see that the universal property is about a bijection between maps $\newcommand\Vect{\mathbf{Vect}}\Vect_k(V\otimes_kW,U)$ and $\Bilin_k(V,W;U)$.

There are of course many more examples of universal properties, but I'll leave this here.

Universal elements and their uniqueness

Firstly, universal objects and universal elements are often not unique, but they are unique up to unique isomorphism in a sense that I'll explain below.

First, an example of nonuniqueness. In the tensor product example we can take any automorphism of $V\otimes_k W$ as a $k$-vector space, $\alpha$, and $\alpha\circ \otimes$ will also be a universal element. The point to take away from this is that the tensor product is not just the object $V\otimes_k W$ (despite the somewhat misleading notation), it is the pair $(V\otimes_k W,\otimes)$. I.e. the pair of a universal object and a universal element. These pairs are not unique, but they are unique up to unique isomorphism, which is just about the best you can get in category theory.

Suppose $(c,x)$ is a representing pair for a functor $F$, and suppose $(c',x')$ is also a representing pair for $F$. Then there is a unique isomorphism $c\to c'$ such that $F(c\to c')$ sends $x$ to $x'$. The proof of this is an application of the Yoneda lemma: We have natural isomorphisms $$C(c,-)\simeq F\simeq C(c',-),$$ where the following elements correspond $$1_c\leftrightarrow x\leftrightarrow \phi:c'\to c$$ $$\psi:c\to c'\leftrightarrow x'\leftrightarrow 1_{c'}.$$ Applying $\psi$ to the first correspondence, and $\phi$ to the second correspondence, we get $$\psi \leftrightarrow F(\psi)(x)\leftrightarrow \psi\phi$$ $$\phi\psi\leftrightarrow F(\phi)(x')\leftrightarrow \phi.$$ Comparing these to the first pair of correspondences, we see that $\psi\phi = 1_{c'}$, $\phi\psi = 1_c$, and $F(\psi)(x) = x'$, which proves existence of the claimed isomorphism. Uniqueness is because if $F(\beta)(x)=x'$, then we would get $\beta \leftrightarrow x' \leftrightarrow \beta\phi$, which forces $\beta = \psi$.