In this tutorial, you’ll learn how to work with external libraries in Python. By the end of this tutorial, you’ll have learned why external libraries are essential to your Python programming journey, and how to install and import external libraries. You’ll also gain an understanding of how external libraries work.
What are External Libraries in Python?
So far, you’ve learned how to use the built-in functions and objects that come with Python. One of the biggest perks of using Python is the availability of vast numbers of libraries. These libraries are incredibly diverse and of high quality and make data science much, much easier in Python.
An external library is a custom set of functions, objects, and more that were written to eliminate having to write code from scratch. There are hundreds of thousands of external libraries with a vast variety of abilities that they provide.
Some of these libraries are part of the standard library. What this means is that they are installed with Python and do not need to be installed. The benefit of this is that all Python users will have the same libraries available to them. To learn more about the standard library, check out the official documentation.
Other libraries need to be installed. These libraries aren’t part of the standard library, but can provide many additional features. Later in this tutorial, you’ll learn how to install libraries using different tools.
Why Use External Libraries in Python?
External libraries in Python open up significant potential in data science. Many data science libraries open up significant opportunities for data analysis, data visualization, and for machine learning. Rather than writing all your functions from scratch, you can lean on the hard work of others.
Python developers make many of their tools available for free. This gives you the opportunity to produce data analysis much quicker and reliably than coding everything yourself.
What’s more is that these libraries are often extensively tested and maintained by groups of developers. This means that these libraries are often very robust, providing incredibly easy ways to make your day to day coding experience significantly easier.
How to Import an External Library in Python
Let’s take a look at how we can import an external library in Python. In this section, we’ll start off by using a library that’s part of the standard library, meaning that you don’t need to worry about installing it.
The example that we’ll use for this section is the
statistics library. The statistics library provides a number of functions for calculating, well, statistics of numeric data.
In order to import a library, we use the
import keyword. Let’s see how we can import the
# Importing a library import statistics
It’s as easy as that! We now have access to all the functions and objects available in the statistics library. Let’s take a look at what happens when we check the type of the library using the
# Checking the type of a library import statistics print(type(statistics)) # Returns: <class 'module'>
We can see that the type is a
'module', which is an interchangeable term of library.
Import Only a Single Function in Python
One of the functions available in the statistics library is the
median() function. In order to actually use the function, you need to prefix the function with the library name. If you’re only planning on using a single function from the library, you can import only that function.
Let’s see how this works:
# Importing only a single function from statistics import median # Allows you to use the median() function without writing statistics.median()
Importing a Python Library with an Alias
You can also change the name of a library as you’re importing it by using an alias. This allows you to use an abbreviation for different libraries. You’ll see this used very frequently when we start working with other libraries like numpy, pandas and seaborn.
In order to use an alias, we use the
as keyword. Let’s import the statistics library using the alias
# Importing a library using an alias import statistics as st
What should you keep in mind when using an alias?
When importing libraries it’s important to follow conventions! It may seem fun to give your imported libraries a cheeky alias, but using conventional aliases, or no aliases at all (when that’s convention) allows you to troubleshoot much quicker!
How to Install External Libraries in Python
There may be many times when you need to install an external library. While Python comes with many excellent modules as part of its standard library, there are many libraries that you may wish to install. Later in this tutorial series, you’ll make extensive use of libraries such as
seaborn in order to work with tabular data and to produce beautiful data visualizations.
In order to install these modules, we can make use of the
pip terminal command.
pip is the package installer for Python and allows us to install modules from the Python Package Index.
pip will come installed when you install Python. That being said, you may not actually have it installed. In order to check whether or not it’s installed or not, run the following in either your command line or terminal:
What if you don’t have pip installed?
If you end up not having Python installed, you can install it directly by downloading it from bootstrap. Download the file linked here and then run it by running
python get-pip.py in your terminal from the location where you downloaded the file.
Once you have pip up and running, you can install external libraries which aren’t part of standard library using pip in the terminal.
For example, to install the
pandas library, you can simply run the following in your terminal or command line:
pip install pandas
pip will handle installing the library by downloading any requirements for it, such as other required dependencies. Feel free to install
pandas now or wait until we make it to that section of the course!
How to Use External Libraries in Python
Now that you know how to import and install external libraries, let’s take a look at how to actually use them. We’ll take a closer look at the
statistics library again. Let’s begin by importing the library again.
# Importing the statistics library import statistics
We can now access all the different functions available in the library. But, how do you actually know what functions are now available to you? For this, we can use the
dir() function, which will return all the different functions and objects available:
# Using the dir() function to see available functions import statistics print(dir(statistics)) # Returns: ['Counter', 'Decimal', 'Fraction', 'NormalDist', 'StatisticsError', '__all__', '__builtins__', '__cached__', '__doc__', '__file__', '__loader__', '__name__', '__package__', '__spec__', '_coerce', '_convert', '_exact_ratio', '_fail_neg', '_find_lteq', '_find_rteq', '_isfinite', '_normal_dist_inv_cdf', '_ss', '_sum', 'bisect_left', 'bisect_right', 'erf', 'exp', 'fabs', 'fmean', 'fsum', 'geometric_mean', 'groupby', 'harmonic_mean', 'hypot', 'itemgetter', 'log', 'math', 'mean', 'median', 'median_grouped', 'median_high', 'median_low', 'mode', 'multimode', 'numbers', 'pstdev', 'pvariance', 'quantiles', 'random', 'sqrt', 'stdev', 'tau', 'variance']
One of these functions is the
median() function. The median is the middle value, or a value that separates the higher half of a distribution from the lower half.
In order to access the
median() function, you need to access the function from the library. It’s not enough to simply call
median(). We need to call
How can you learn how to use the
median() function? You can use the
help() function in order to gain access to its docstrings:
# Getting help! import statistics print(help(statistics.median())) # Returns: # Median(data) # Return the median (middle value) of numeric data. # When the number of data points is odd, return the middle data point. # When the number of data points is even, the median is interpolated by # taking the average of the two middle values: # >>> median([1, 3, 5]) # 3 # >>> median([1, 3, 5, 7]) # 4.0
We can see from the docstring how the method works. We can pass in a list of numbers to return the middle value.
This is great! We don’t need to develop or test a function ourselves, but we can simply import the function!
Again, Why use Python Libraries?
Now that you have a much stronger understanding of Python libraries, let’s take another look at how much benefit they give us. We’ll take a look at calculating the standard deviation from scratch and then take a look at how to do this using a built-in library.
As a quick refresher, the standard deviation measures the amount of dispersion in a normally distributed set of data. It’s calculated using the following formula:
σ = √Σ (xi – μ)2 / (n-1)
Let’s take a look at how we can calculate this using Python. Don’t worry if some of these pieces don’t yet make sense. If you are interested in doing a deep dive into this, check out this in-depth tutorial on the standard deviation in Python.
# Calculating a Standard Deviation from Scratch sample_list = [170,155,160,185,145] # Finding Mean value sums = 0 for value in sample_list: sums += value mean = sums / len(sample_list) # Finding square of difference of mean and each value difference_squared = 0 for j in sample_list: difference_squared += (j - mean) ** 2 # Finding Square Root standard_deviation = (difference_squared / ((len(sample_list)) - 1)) ** 0.5 print(standard_deviation) # Returns 15.25
This is a lot of code! There’s a lot of overhead and complexity in this that’s probably unnecessary. We could, put simply, just use the
stdev function from the statistics library. Let’s see how much easier that is!
from statistics import stdev sample_list = [170,155,160,185,145] print(stdev(sample_list)) # Returns 15.25
Isn’t that much, much easier? In the next section, you can complete a couple of checks to see solidify your understanding.
Use the sections below to check your understanding of external libraries in Python:
What does it mean for a module to be part of the standard library?
Modules that are part of the standard library come installed with Python. This means that you don’t need to check if the library is installed and that any user of Python (with the same version of Python) will have access to these libraries.
How would you import the math library?
How can you find out what functions a library has?
How would you install the pandas library?
pip install pandas
Conclusion and Recap
In this tutorial, you learned what external libraries are and how to use them in Python. You learned how to work with modules that are part of the standard library and how to work with ones you need to install. Some of the highlights of using external libraries are:
- Libraries simplify your code by allowing us to use pre-made functions and objects from other developers
- We can use the
piptool to install external libraries
- When using external libraries, we can import the entire library or only some functions
- You can use the
dir()function to get an understanding of what functions are available in the library
- You can access information about different functions using the
To learn more about related topics, check out these tutorials: