# ADiPy, Automatic Differentiation for Python

ADiPy is a fast, pure-python automatic differentiation (AD) library. This package provides the following functionality:

- Arbitrary order univariate differentiation
- First-order multivariate differentiation
- Univariate Taylor polynomial function generator
- Jacobian matrix generator
- Compatible linear algebra routines

## Installation

To install `adipy`

, simply do one of the following in a terminal window
(administrative priviledges may be required):

- Download the tarball, unzip, then run
`python setup.py install`

in the unzipped directory. - Run
`easy_install [--upgrade] adipy`

- Run
`pip install [--upgrade] adipy`

## Where to Start

To start, we use the simple import:

from adipy import *

This imports the necessary constructors and elementary functions (sin, exp,
sqrt, etc.) as well as `np`

which is the root NumPy module.

Now, we can construct AD objects using either `ad(...)`

or `adn(...)`

. For
multivariate operations, it is recommended to construct them all at once using
the `ad(...)`

function, but this is not required. The syntax is only a little
more complicated if they are initialized separately.

## Univariate Examples

Here are some examples of univariate operations:

# A single, first-order differentiable object x = ad(1.5) y = x**2 print y # output is: ad(2.25, array([3.0])) # What is dy/dx? print y.d(1) # output is: 3.0 z = x*sin(x**2) print z # output is: ad(1.1671097953318819, array([-2.0487081053644052])) # What is dz/dx? print z.d(1) # output is: -2.0487081053644052 # A single, fourth-order differentiable object x = adn(1.5, 4) y = x**2 print y # output is: ad(2.25, array([ 3., 2., 0., -0.])) # What is the second derivative of y with respect to x? print y.d(2) # output is: 2.0 z = x*sin(x**2) print z # output is: # ad(1.1671097953318819, array([ -2.04870811, -16.15755076, -20.34396265, 194.11618384])) # What is the fourth derivative of z with respect to x? print z.d(4) # output is: 194.116183837

As can be seen in the examples, when an AD object is printed out, you see two sets of numbers. The first is the nominal value, or the zero-th derivative. The next set of values are the 1st through the Nth order derivatives, evaluated at the nominal value.

## Multivariate Examples

For multivariate sessions, things look a little bit different and can only handle first derivatives (for the time being), but behave similarly:

x = ad(np.array([-1, 2.1, 0.25])) y = x**2 print y # output is: # ad(array([ 1. , 4.41 , 0.0625]), array([[[-2. , 0. , 0. ], # [-0. , 4.2, 0. ], # [-0. , 0. , 0.5]]]))

This essentially just performed the `**2`

operator on each object individually,
so we can see the derivatives for each array index and how they are not
dependent on each other. Using standard indexing operations, we can access the
individual elements of an AD multivariate object:

print x[0] # output is: # ad(-1, array([ 1., 0., 0.]))

What if we want to use more than one AD object in calculations? Let's see what happens:

z = x[0]*sin(x[1]*x[2]) print z # output is: # ad(-0.50121300467379792, array([[ 0.501213 , -0.21633099, -1.81718028]]))

The result here shows both the nominal value for z, but also the partial derivatives for each of the x values. Thus, dz/dx[0] = 0.501213, etc.

## Jacobian

If we have multiple outputs, like:

y = [0]*2 y[0] = x[0]*x[1]/x[2] y[1] = -x[2]**x[0]

we can use the `jacobian`

function to summarize the partial derivatives for
each index of y:

print jacobian(y) # output is: [[ 8.4 -4. 33.6 ] # [ 5.54517744 0. 16. ]]

Just as before, we can extract the first partial derivatives:

print z.d(1) # output is: [ 0.501213 -0.21633099 -1.81718028]

For the object y, we can't yet use the `d(...)`

function yet, because it is
technically a list at this point. However, we can convert it to a single,
multivariate AD object using the `unite`

function, then we'll have access
to the `d(...)`

function. The `jacobian`

function's result is the same in
both cases:

y = unite(y) print y.d(1) # output is: [[ 8.4 -4. 33.6 ] # [ 5.54517744 0. 16. ]] print jacobian(y) # output is: [[ 8.4 -4. 33.6 ] # [ 5.54517744 0. 16. ]]

Like was mentioned before, multivariate sessions can initialize individual independent AD objects, though not quite as conveniently as before, using the following syntax:

x = ad(-1, np.array([1, 0, 0])) y = ad(2.1, np.array([0, 1, 0])) z = ad(0.25, np.array([0, 0, 1]))

This allows all the partial derivatives to be tracked, noted at the respective unitary index at initialization. Conversely to singular construction, we can break-out the individual elements, if desired:

x, y, z = ad([np.array([-1, 2.1, 0.25]))

And the results are the same.

## Univariate Taylor Series Approximation

For univariate functions, we can use the `taylorfunc`

function to generate
an callable function that allows approximation to some specifiable order:

x = adn(1.5, 6) # a sixth-order AD object z = x*sin(x**2) fz = taylorfunc(z, at=x.nom)

The "at" keyword designates the point that the series is expanded about, which
will likely always be at the nominal value of the original independent AD
object (e.g., `x.nom`

). Now, we can use `fz`

whenever we need to
approximate `x*sin(x**2)`

, but know that the farther it is evaluated from
`x.nom`

, the more error there will be in the approximation.

If Matplotlib is installed, we can see the difference in the order of the approximating Taylor polynomials:

import matplotlib.pyplot as plt xAD = [adn(1.5, i) for i in xrange(1, 7)] # a list of ith-order AD objects def z(x): return x*sin(x**2) x = np.linspace(0.75, 2.25) plt.plot(x, z(x), label='Actual Function') for i in xrange(len(xAD)): fz = taylorfunc(z(xAD[i]), at=xAD[i].nom) plt.plot(x, fz(x), label='Order %d Taylor'%(i+1)) plt.legend(loc=0) plt.show()

Notice that at x=1.5, all the approximations are perfectly accurate (as we would expect) and error increases as the approximation moves farther from that point, but less so with the increase in the order of the approximation.

## Linear Algebra

Several linear algebra routines are available that are AD-compatible:

- Decompositions
- Cholesky (
`chol`

) - QR (
`qr`

) - LU (
`lu`

)

- Cholesky (
- Linear System solvers
- General solver, with support for multiple outputs (
`solve`

) - Least squares solver (
`lstsq`

) - Matrix inverse (
`inv`

)

- General solver, with support for multiple outputs (
- Matrix Norms
- Frobenius norm, or 2-norm (
`norm`

)

- Frobenius norm, or 2-norm (

These require a separate import `import adipy.linalg`

, then they can be
using something like `adipy.linalg.solve(...)`

.

See the source code for relevant documentation and examples. If you are familiar with NumPy's versions, you will find these easy to use.

## Support

Please contact the author with any questions, comments, or good examples of
how you've used `adipy`

!

## License

This package is distributed under the BSD License. It is free for public and commercial use and may be copied royalty free, provided the author is given credit.