Calculus/Conic sections

← Rational functions	Calculus	Precalculus/Exercises →
	Conic sections

Conic sections are the intersections of a surface of a cone and a plane. There are three ways to intersect. The first method is to intersect the cone vertically, which the intersection will yield a hyperbola. The second method is to intersect the cone parallel to the outermost line of the cone, which will yield a parabola. The third method is to intersect the cone horizontally or slightly horizontally, which will yield an ellipse. For more information, see Conic section. If you have knowledge on this particular subject, you can help expand here.

In future chapters, you will encounter more about conic sections. As you progress into polar coordinates, parametric equations, and three-dimensional quadric surfaces, conic sections will again be a difficult subject to discuss. In this chapter, we will only talk about the basic Cartesian-coordinate conic sections.

Standard equations

Ellipse

Ellipses are shapes that have an interesting property. In order to find the standard equation for the ellipse, we must know what an ellipse is. Apart from the intersection of a inclined plane and the surface of a cone, there is another way to construct an ellipse.

Definition of an ellipse

An ellipse is a plane curve surrounding two focal points, such that for all points on the curve, the sum of the two distances to the focal points is a constant.

Alternatively, assume points $F_{1}=(c_{x1},c_{y1})$ , $F_{2}=(c_{x2},c_{y2})$ , and $P=(x,y)$ , where $c$ is a constant. The ellipse is a set of points that satisfies:

P\in \{(x,y):PF_{1}+PF_{2}=C,C\in \mathbb {R} \}

The image on the right is the graph of an ellipse. If there is a point $P=(x,y)$ , then $PF_{1}+PF_{2}=C$ , where $C$ is a constant.

Knowing the defining characteristic of an ellipse, we can start find the equation.

Ellipse: notations

Derivation

To make things simple, let's set the center of the ellipse on the origin and the points $F_{1}$ and $F_{2}$ on the $x$ -axis. (see image on the right)

Since $PF_{1}+PF_{2}=C$ and $F_{1}=(c,0),F_{2}=(-c,0),P=(x,y)$ , using the distance formula, we can get:

$PF_{1}={\sqrt {(x-c)^{2}+(y-0)^{2}}}={\sqrt {x^{2}-2xc+c^{2}+y^{2}}}$
$PF_{2}={\sqrt {(x+c)^{2}+(y-0)^{2}}}={\sqrt {x^{2}+2xc+c^{2}+y^{2}}}$
$PF_{1}+PF_{2}={\sqrt {x^{2}-2xc+c^{2}+y^{2}}}+{\sqrt {x^{2}+2xc+c^{2}+y^{2}}}$

Now for the constant $C$ . Let the length of the semi-major axis be $a$ and the length of the semi-minor axis be $b$ . Imagine that point $P$ is now at the vertex, so $P_{0}=(a,0)$ . At this particular point,

$P_{0}F_{1}=a-c$
$P_{0}F_{2}=a+c$
$\therefore C=P_{0}F_{1}+P_{0}F_{2}=2a$

Because the definition says that $PF_{1}+PF_{2}=C$ for any $P$ , we can safely say that $C=2a$ . Thus, we can start solving the equation.

${\sqrt {x^{2}-2xc+c^{2}+y^{2}}}+{\sqrt {x^{2}+2xc+c^{2}+y^{2}}}=2a$
$\Leftrightarrow {\sqrt {x^{2}-2xc+c^{2}+y^{2}}}=2a-{\sqrt {x^{2}+2xc+c^{2}+y^{2}}}$
$\Leftrightarrow x^{2}-2xc+c^{2}+y^{2}=4a^{2}-4a{\sqrt {x^{2}+2xc+c^{2}+y^{2}}}+x^{2}+2xc+c^{2}+y^{2}$ (Square both sides)
$\Leftrightarrow 4a{\sqrt {x^{2}+2xc+c^{2}+y^{2}}}=4a^{2}+4xc$ (Simplify it)
$\Leftrightarrow a{\sqrt {x^{2}+2xc+c^{2}+y^{2}}}=a^{2}+xc$
$\Leftrightarrow a^{2}(x^{2}+2xc+c^{2}+y^{2})=a^{4}+2a^{2}xc+x^{2}c^{2}$ (Square both sides)
$\Leftrightarrow a^{2}x^{2}+2a^{2}xc+a^{2}c^{2}+a^{2}y^{2}=a^{4}+2a^{2}xc+x^{2}c^{2}$
$\Leftrightarrow a^{2}x^{2}-c^{2}x^{2}+a^{2}y^{2}=a^{4}-a^{2}c^{2}$ (Simplify it)
$\Leftrightarrow x^{2}(a^{2}-c^{2})+a^{2}y^{2}=a^{2}(a^{2}-c^{2})$ (Factor it)
Finally, the equation is
${\frac {x^{2}}{a^{2}}}+{\frac {y^{2}}{a^{2}-c^{2}}}=1$

This should be the equation. But $a^{2}-c^{2}$ can be further simplified. To do so, imagine again that our point $P$ is on the co-vertex, so $P_{1}=(0,b)$ . Thus,

$P_{1}F_{1}={\sqrt {c^{2}+b^{2}}}$
$P_{1}F_{2}={\sqrt {c^{2}+b^{2}}}$
$\therefore C=P_{1}F_{1}+P_{1}F_{2}=2{\sqrt {c^{2}+b^{2}}}$

Since we have already established that $C=2a$ , we can write down an equation that links $a,b,c$ together:

$2{\sqrt {c^{2}+b^{2}}}=2a$
$\Leftrightarrow c^{2}+b^{2}=a^{2}$
$\Leftrightarrow a^{2}-c^{2}=b^{2}$

Now we substitute $a^{2}-c^{2}$ as $b^{2}$ , we get the final result:

${\frac {x^{2}}{a^{2}}}+{\frac {y^{2}}{b^{2}}}=1$

$\blacksquare$

This equation is the standard form of the ellipse. It is considered standard because all key points are on the axes.

Terminology and properties

We already derived the equation. So, the terminology and properties will be based on that.

${\frac {x^{2}}{a^{2}}}+{\frac {y^{2}}{b^{2}}}=1$

Focus (plural: foci): the points $F_{1},F_{2}$ which have coordinates $(c,0),(-c,0)$ respectively. The definition gives those points their function: $PF_{1}+PF_{2}=2a$ .
Semi-major axis: the axis with length $a$ .
Semi-minor axis: the axis with length $b$ , $b<a$ .
Vertex (plural: vertices): the endpoint of the semi-major axis. It has the coordinate $(\pm a,0)$ .
Co-vertex: the endpoint of the semi-minor axis. It has the coordinate $(\pm b,0)$ , $b<a$ .
Center: the middle point between the two foci. It has the coordinate $(0,0)$ .

Note that any changes towards the equation will change the coordinates for the key values. The coordinates above are strictly based upon the standard equation of the ellipse.

In the derivation, we stumbled upon a property, which is the relationship between the constants $a,b,c$ . The property is $c^{2}=a^{2}-b^{2}$ . In the derivation of the equation of the hyperbola, we will encounter this property again. However, it will be slightly different because of the signs. In the ellipse, $a>c$ , so the property $c^{2}=a^{2}-b^{2}$ ensures that $a^{2}>0,b^{2}>0,c^{2}>0$ . In the hyperbola, as we will see, $c>a$ , which means the length of the foci to the center is larger then that of the vertices to the center. In order to ensure that $a^{2}>0,b^{2}>0,c^{2}>0$ for convenience, the property will be slightly adjusted.

Transformations

If we want the ellipse to be more "vertical" instead of "horizontal", the equation of the ellipse needs to be changed. To be more "vertical", the foci of the ellipse should be on the $y$ -axis, having coordinates $(0,\pm c)$ . Using the same method for derivation, we get:

${\frac {x^{2}}{b^{2}}}+{\frac {y^{2}}{a^{2}}}=1$

If we want the ellipse to translate (move without rotating) in the plane, using what we've learned in Chapter 1.2, we can modify the equation into:

${\frac {(x-m)^{2}}{a^{2}}}+{\frac {(y-n)^{2}}{b^{2}}}=1$ , where $(m,n)$ is the center of the ellipse.

Parabola

Parabolas can be interpreted as the more general form of the quadratic function. However, they are essentially different. While quadratic function is a function which describes a relationship between a variable and another, parabola is a curve in 2D space.

Definition of a parabola

Assume there is a point $F=(p_{1},p_{2})$ and a line $l:y=mx+b$ that does not go through point $F$ . The parabola is a set of points $P=(x,y)$ that satisfies:

P\in \{(x,y):PF=Pl\}

To make things simple for derivation, we put the point $F$ on the $y$ -axis and the vertex of the parabola on the origin (see image on the right), so $F=(0,p)$ Now, imagine that point $P$ is on the vertex, so $P_{0}=(0,0)$ . Because $PF=Pl$ and $PF=p$ , the equation for line $l$ is $l:y=-p$ .

Now, we can start deriving the standard equation for the parabola.

Part of a parabola (blue), with various features (other colours). The complete parabola has no endpoints. In this orientation, it extends infinitely to the left, right, and upward.