MathAlgebraGroups

Introduction to groups

12 minutes read

There are no precise boundaries between different branches of math, no strict and formal rules defining which concepts and facts belong to discrete math and which to algebra, which to real analysis, and which to the topology. But a somewhat vague description of algebra may say that it's an area of math seeking similar patterns in different situations.

For example, one of the earliest algebraic ideas, introduced by the 16th-century French mathematician François Viète, was representing known and unknown numbers by letters. Now it seems natural to us to write (a+b)2=a2+2ab+b2(a + b)^{2} = a^{2} + 2ab + b^{2} without bothering ourselves about what numbers hide under aa and bb, but four and a half centuries ago, using letters was a breakthrough. (Fun fact: Viète served as a codebreaker to a French king, and when the King of Spain discovered that his secret communications were read, he was shocked and accused the French of using black magic.)

Looking at the further development of algebra, you notice that it always tries to find some general structures, to reveal similarities. These general patterns may seem abstract, but that makes them more universal. Let's talk about the concept of a group -- abstract, but simple, ubiquitous, and, as it turns out, very useful.

Binary operations

Consider the integers. We can add, subtract and multiply them, we can also find the greatest common divisor of any two of them. In all these cases, we take two integers as our input and produce a new integer as an output. That means we have just seen several examples of binary operations (on the set Z\mathbb{Z}).

Definition. A binary operation on a set AA is a mapping (a function) from A×AA \times A to AA.

The result of a binary operation applied to (a,b)(a, b) is usually written in the form aba \star b , where a star is a special symbol for the operation. In different settings, the symbols can be different, such as plus ++, dot \cdot or circle \circ. Moreover, sometimes a symbol of the operation(usually the symbol \cdot ) is omitted, and instead of aba \cdot b we write abab . The arguments aa and bb are called the operands.

Let's consider one more example. Suppose XX is a non-empty set and S(X)S(X) is the set of all bijections from XX to itself. For instance, if X={1,2}X = \{1, 2\}, then S(X)S(X) consists of only two bijective mappings: S(X)={f,g}S(X) = \{f, g\} (look at the pictures):

Two bijection

Of course, for larger XX, the set S(X)S(X) will also be much larger. But no matter how tiny or how huge XX is, for any f,gS(X)f, g \in S(X) we can form the composition gfg \circ f defined as usual: if xXx \in X, then (gf)(x)=g(f(x))(g \circ f)(x) = g(f(x))So the composition is, in fact, a binary operation on S(X)S(X).

The definition of a group

Can we notice some common properties for addition on the set of integers Z\mathbb{Z} and composition on the set of bijections S(X)S(X) for some XX?

First, both operations are associative. It doesn't matter how you place the brackets in the expression 2+3+4,2 + 3 + 4,as both (2+3)+4=5+4(2 + 3) + 4 = 5 + 4 and 2+(3+4)=2+72 + (3 + 4) = 2 + 7 give 99, and the similar fact is true for any three integers. The brackets' placement also doesn't change anything with functions:

h(gf)=(hg)fh \circ (g \circ f) =(h \circ g) \circ fIf you wonder why the last statement is true, try to apply the function from the left-hand side and the function from the right-hand side to the same element xXx \in X and compare the results:

(h(gf))(x)=h((gf)(x))=h(g(f(x))(h \circ (g \circ f))(x) = h ((g \circ f)(x)) = h(g(f(x))\\

Applying the function from the left-hand side

and

((hg)f)(x)=(hg)(f(x))=h(g(f(x))((h \circ g) \circ f)(x) = (h \circ g) (f(x)) = h(g(f(x))\\

Applying the function from the right-hand side

Second, adding zero to any integer doesn't change this integer. An analogous role in S(X)S(X) will be played by an identity function idS(X)id \in S(X) such that

id(x)=x for every xXid(x) = x \text{ for every } x \in XIndeed, for any fS(X)f \in S(X) and any xXx \in X

(idf)(x)=id(f(x))=f(x)(id \circ f) (x) = id(f(x)) = f(x)\\(fid)(x)=f(id(x))=f(x)(f \circ id)(x) = f(id(x)) = f(x)so idf=fid=f.id \circ f = f \circ id = f.Third, for any integer aa, the sum of aa and a-a is zero. Likewise, any bijection ff from S(X)S(X) is invertible -- that means, there exists another function, the composition of which with ff (in any order) gives the identity function.

Let's give a formal definition.

Definition. A set GG with a binary operation \star on it is called a group, if it satisfies the following three properties:

  1. (ab)c=a(bc)(a \star b) \star c = a \star (b \star c) for any a,b,cGa, b, c \in G (this property is known as the associativity of operation \star)
  2. there exists an element eGe \in G (called a neutral element or an identity) such that ae=ea=aa \star e = e \star a = a for any aGa \in G
  3. for each aa there exists an element aGa' \in G such that aa=aa=ea \star a' = a' \star a = e (such aa' is called an inverse of aa)

Note that the neutral element is unique: if both ee andee' are identities, then e=ee=e.e = e \star e' = e.Also, the inverse of a given aa is unique.

Indeed, suppose that aa' and aa'' are both inverses of aa. That means that aa=ea \star a'' = e and aa=ea' \star a = e, so

a=ae=a(aa)=(aa)a=ea=aa' = a' \star e =a' \star (a \star a'') = (a'\star a) \star a'' = e \star a'' = a''

We'll usually denote set GG with a binary operation \star , satisfying the definition of a group, as (G,)(G, \star). In some situations, mathematicians specify only the set and assume that the operation is obvious from the context.

More examples

Of course, the examples of groups are not limited to (Z,+)(\mathbb{Z}, +) and (S(X),)(S(X), \circ). Consider the set R=R{0}\mathbb{R}^{*} = \mathbb{R} \setminus \{0 \} (that is, all real numbers except zero) and multiplication on it. You can easily see that the operation is associative, that 11 is a neutral element and that for every element rRr \in \mathbb{R}^{*} -- in other words, for any real number not equal to zero -- there exist an inverse r1r^{-1} such that rr1=r1r=1r \cdot r^{-1} = r^{-1} \cdot r = 1

If the group is written multiplicatively (that is, using the symbol \cdot for the operation), the inverse of an element aa is always denoted by a1a^{-1}. In the case of the additive notation (using the symbol ++ for the operation), the inverse (sometimes we say explicitly: the additive inverse) is usually written as a-a.

Another important example is the group of residues modulo mm with respect to modular addition. Recall that its elements can be viewed as the integers from 00 to m1m-1, and their modular sum is just the remainder of their ordinary sum divided by mm. We'll denote this group as (Zm,+)(\mathbb{Z}_m, +) , and write its elements with a line over: 0,1,,m1\overline{0}, \overline{1}, \ldots, \overline{m - 1}. For instance, Z5={0,1,2,3,4}\mathbb{Z}_5 = \{\overline{0},\overline{1}, \overline{2}, \overline{3}, \overline{4}\}The addition in Zm\mathbb{Z}_m is associative, 0\overline{0} is the neutral element, and for x\overline{x}, the role of the inverse is played by mx\overline{m - x} .

Abelian groups

Among the groups discussed above, the group (S(X),)(S(X), \circ) of bijections stands out.

In (Z,+)(\mathbb{Z}, +), (Zm,+)(\mathbb{Z}_m, +), or (R,)(\mathbb{R}^{*}, \cdot), you can change the order of two operands and the result will stay the same. On the other hand, fggff \circ g \neq g \circ f in the general case of arbitrary f,gS(X)f, g \in S(X).

Definition. A group (G,)(G, \star) with the additional property that ab=ba for any a,bGa \star b = b \star a \text{ for any } a, b \in G is called commutative, or abelian (after the name of Norwegian mathematician Niels Henrik Abel) group.

The groups (Z,+)(\mathbb{Z}, +), (Zm,+)(\mathbb{Z}_m, +), or (R,)(\mathbb{R}^{*}, \cdot) are abelian. (S(X),)(S(X), \circ) is non-abelian, and there exist other examples of non-abelian groups. That's why in properties 2 and 3 of a group we have written aa=aa=ea \star a' = a' \star a = e and aa=aa=e,a \star a' = a' \star a = e,mentioned the products in both possible orders.

Conclusion

In this topic, we got acquainted with the concept of a group, seeing the common properties of seemingly different objects. Such generalization allows us not to spend our attention on unnecessary details and thus turns out to be very useful not only for theoretical research but also for practical applications. We have learned that:

  • A binary operation takes two elements and produces a third, new element. We denote it like this: aba \star b, where \star represents the symbol of operation, while aa and bb are the so-called operands.
  • For a set with a binary operation to be called a group has to satisfy 3 conditions: associativity, presence of an identity element, and the existence of an inverse. These 3 conditions must hold true for all the elements in the set.
  • Abelian groups are groups that also meet additional criteria in commutativity, i.e. ab=ba for any a,bGa \star b = b \star a \text{ for any } a, b \in G.
4 learners liked this piece of theory. 0 didn't like it. What about you?
Report a typo