Skip to content

Working with External Libraries in Python

Working with External Libraries in Python Cover Image

In this tutorial, you’ll learn how to work with external libraries in Python. By the end of this tutorial, you’ll have learned why external libraries are essential to your Python programming journey, and how to install and import external libraries. You’ll also gain an understanding of how external libraries work.

What are External Libraries in Python?

So far, you’ve learned how to use the built-in functions and objects that come with Python. One of the biggest perks of using Python is the availability of vast numbers of libraries. These libraries are incredibly diverse and of high quality and make data science much, much easier in Python.

An external library is a custom set of functions, objects, and more that were written to eliminate having to write code from scratch. There are hundreds of thousands of external libraries with a vast variety of abilities that they provide.

Some of these libraries are part of the standard library. What this means is that they are installed with Python and do not need to be installed. The benefit of this is that all Python users will have the same libraries available to them. To learn more about the standard library, check out the official documentation.

Other libraries need to be installed. These libraries aren’t part of the standard library, but can provide many additional features. Later in this tutorial, you’ll learn how to install libraries using different tools.

Why Use External Libraries in Python?

External libraries in Python open up significant potential in data science. Many data science libraries open up significant opportunities for data analysis, data visualization, and for machine learning. Rather than writing all your functions from scratch, you can lean on the hard work of others.

Python developers make many of their tools available for free. This gives you the opportunity to produce data analysis much quicker and reliably than coding everything yourself.

What’s more is that these libraries are often extensively tested and maintained by groups of developers. This means that these libraries are often very robust, providing incredibly easy ways to make your day to day coding experience significantly easier.

How to Import an External Library in Python

Let’s take a look at how we can import an external library in Python. In this section, we’ll start off by using a library that’s part of the standard library, meaning that you don’t need to worry about installing it.

The example that we’ll use for this section is the statistics library. The statistics library provides a number of functions for calculating, well, statistics of numeric data.

In order to import a library, we use the import keyword. Let’s see how we can import the statistics library.

# Importing a library
import statistics

It’s as easy as that! We now have access to all the functions and objects available in the statistics library. Let’s take a look at what happens when we check the type of the library using the type() function.

# Checking the type of a library
import statistics
print(type(statistics))

# Returns: <class 'module'>

We can see that the type is a 'module', which is an interchangeable term of library.

Import Only a Single Function in Python

One of the functions available in the statistics library is the median() function. In order to actually use the function, you need to prefix the function with the library name. If you’re only planning on using a single function from the library, you can import only that function.

Let’s see how this works:

# Importing only a single function
from statistics import median

# Allows you to use the median() function without writing statistics.median()

Importing a Python Library with an Alias

You can also change the name of a library as you’re importing it by using an alias. This allows you to use an abbreviation for different libraries. You’ll see this used very frequently when we start working with other libraries like numpy, pandas and seaborn.

In order to use an alias, we use the as keyword. Let’s import the statistics library using the alias st.

# Importing a library using an alias
import statistics as st

What should you keep in mind when using an alias?

When importing libraries it’s important to follow conventions! It may seem fun to give your imported libraries a cheeky alias, but using conventional aliases, or no aliases at all (when that’s convention) allows you to troubleshoot much quicker!

How to Install External Libraries in Python

There may be many times when you need to install an external library. While Python comes with many excellent modules as part of its standard library, there are many libraries that you may wish to install. Later in this tutorial series, you’ll make extensive use of libraries such as pandas and seaborn in order to work with tabular data and to produce beautiful data visualizations.

In order to install these modules, we can make use of the pip terminal command. pip is the package installer for Python and allows us to install modules from the Python Package Index.

Typically pip will come installed when you install Python. That being said, you may not actually have it installed. In order to check whether or not it’s installed or not, run the following in either your command line or terminal:

python3 -m pip --version
python -m pip --version

What if you don’t have pip installed?

If you end up not having Python installed, you can install it directly by downloading it from bootstrap. Download the file linked here and then run it by running python get-pip.py in your terminal from the location where you downloaded the file.

Once you have pip up and running, you can install external libraries which aren’t part of standard library using pip in the terminal.

For example, to install the pandas library, you can simply run the following in your terminal or command line:

pip install pandas

pip will handle installing the library by downloading any requirements for it, such as other required dependencies. Feel free to install pandas now or wait until we make it to that section of the course!

How to Use External Libraries in Python

Now that you know how to import and install external libraries, let’s take a look at how to actually use them. We’ll take a closer look at the statistics library again. Let’s begin by importing the library again.

# Importing the statistics library
import statistics

We can now access all the different functions available in the library. But, how do you actually know what functions are now available to you? For this, we can use the dir() function, which will return all the different functions and objects available:

# Using the dir() function to see available functions
import statistics
print(dir(statistics))

# Returns: ['Counter', 'Decimal', 'Fraction', 'NormalDist', 'StatisticsError', '__all__', '__builtins__', '__cached__', '__doc__', '__file__', '__loader__', '__name__', '__package__', '__spec__', '_coerce', '_convert', '_exact_ratio', '_fail_neg', '_find_lteq', '_find_rteq', '_isfinite', '_normal_dist_inv_cdf', '_ss', '_sum', 'bisect_left', 'bisect_right', 'erf', 'exp', 'fabs', 'fmean', 'fsum', 'geometric_mean', 'groupby', 'harmonic_mean', 'hypot', 'itemgetter', 'log', 'math', 'mean', 'median', 'median_grouped', 'median_high', 'median_low', 'mode', 'multimode', 'numbers', 'pstdev', 'pvariance', 'quantiles', 'random', 'sqrt', 'stdev', 'tau', 'variance']

One of these functions is the median() function. The median is the middle value, or a value that separates the higher half of a distribution from the lower half.

In order to access the median() function, you need to access the function from the library. It’s not enough to simply call median(). We need to call statistics.median().

How can you learn how to use the median() function? You can use the help() function in order to gain access to its docstrings:

# Getting help!
import statistics

print(help(statistics.median)

# Returns:
# Median(data)
#     Return the median (middle value) of numeric data.
    
#     When the number of data points is odd, return the middle data point.
#     When the number of data points is even, the median is interpolated by
#     taking the average of the two middle values:
    
#     >>> median([1, 3, 5])
#     3
#     >>> median([1, 3, 5, 7])
#     4.0

We can see from the docstring how the method works. We can pass in a list of numbers to return the middle value.

This is great! We don’t need to develop or test a function ourselves, but we can simply import the function!

Again, Why use Python Libraries?

Now that you have a much stronger understanding of Python libraries, let’s take another look at how much benefit they give us. We’ll take a look at calculating the standard deviation from scratch and then take a look at how to do this using a built-in library.

As a quick refresher, the standard deviation measures the amount of dispersion in a normally distributed set of data. It’s calculated using the following formula:

σ = √Σ (xi – μ)2 / (n-1)

Let’s take a look at how we can calculate this using Python. Don’t worry if some of these pieces don’t yet make sense. If you are interested in doing a deep dive into this, check out this in-depth tutorial on the standard deviation in Python.

# Calculating a Standard Deviation from Scratch
sample_list = [170,155,160,185,145]

# Finding Mean value
sums = 0
for value in sample_list:
    sums += value

mean = sums / len(sample_list)

# Finding square of difference of mean and each value
difference_squared = 0
for j in sample_list:
    difference_squared += (j - mean) ** 2

# Finding Square Root
standard_deviation = (difference_squared / ((len(sample_list)) - 1)) ** 0.5

print(standard_deviation)
# Returns 15.25

This is a lot of code! There’s a lot of overhead and complexity in this that’s probably unnecessary. We could, put simply, just use the stdev function from the statistics library. Let’s see how much easier that is!

from statistics import stdev
sample_list = [170,155,160,185,145]

print(stdev(sample_list))
# Returns 15.25

Isn’t that much, much easier? In the next section, you can complete a couple of checks to see solidify your understanding.

Exercises

Use the sections below to check your understanding of external libraries in Python:

What does it mean for a module to be part of the standard library?

Modules that are part of the standard library come installed with Python. This means that you don’t need to check if the library is installed and that any user of Python (with the same version of Python) will have access to these libraries.

How would you import the math library?

import math

How can you find out what functions a library has?

Using the dir() function.

How would you install the pandas library?

pip install pandas

Conclusion and Recap

In this tutorial, you learned what external libraries are and how to use them in Python. You learned how to work with modules that are part of the standard library and how to work with ones you need to install. Some of the highlights of using external libraries are:

  • Libraries simplify your code by allowing us to use pre-made functions and objects from other developers
  • We can use the pip tool to install external libraries
  • When using external libraries, we can import the entire library or only some functions
  • You can use the dir() function to get an understanding of what functions are available in the library
  • You can access information about different functions using the help() function

Additional Resources

To learn more about related topics, check out these tutorials:

Nik Piepenbreier

Nik is the author of datagy.io and has over a decade of experience working with data analytics, data science, and Python. He specializes in teaching developers how to use Python for data science using hands-on tutorials.View Author posts

6 thoughts on “Working with External Libraries in Python”

  1. Hi Nik,
    Thanks for this great tutorial series. I admire your passion for teaching.

    I noticed that in the section ‘Again, Why use Python Libraries?’ you state ‘If you are interested in doing a deep dive into this, check out this in-depth tutorial on the standard deviation in Python.’ However, there is no link to a standard deviation tutorial. I would be interested to see that tutorial.

    1. Hi John,

      Sorry for the late response.

      Thanks so much for your comment and for catching that! I have updated the article with the link to the standard deviation tutorial :).

      Have a great day!

  2. Hi Nik! Loved this lesson, thanks, just a little correction:

    In the getting help part, there is an issue with the code that may prevent it from executing properly. Specifically, the help() function is meant to be called on a function or module object, but in this case, it is being called on the return value of statistics.median(). Since statistics.median() returns a value, not an object, calling help(statistics.median()) will generate a TypeError because the help() function does not accept an argument of type int or float.

    To fix this issue, you can simply remove the parentheses after median so that help() is called on the function object itself, rather than its return value, like this:

    import statistics
    print(help(statistics.median))

Leave a Reply

Your email address will not be published. Required fields are marked *