Home Python C Language C ++ HTML 5 CSS Javascript Java Kotlin SQL DJango Bootstrap React.js R C# PHP ASP.Net Numpy Dart Pandas Digital Marketing

Seaborn for Statistical Plots in Python


Seaborn is a powerful Python visualization library built on top of Matplotlib. It provides a high-level interface for creating visually appealing and informative statistical plots. Seaborn simplifies the process of creating complex statistical plots with just a few lines of code.

Installing Seaborn

If Seaborn is not already installed, you can install it using the following pip command:

    pip install seaborn
        

Importing Seaborn

Once Seaborn is installed, you can import it along with Matplotlib for plotting:

    import seaborn as sns
    import matplotlib.pyplot as plt
        

Creating a Simple Distribution Plot

One of the most common plots in Seaborn is the distribution plot. It is used to visualize the distribution of a dataset. The sns.distplot() function can be used to create this plot. Here’s an example:

    # Importing Seaborn
    import seaborn as sns
    import matplotlib.pyplot as plt

    # Creating a sample dataset
    data = [12, 15, 14, 10, 17, 18, 14, 16, 13, 19, 20, 22, 25, 18, 17]

    # Creating a distribution plot
    sns.histplot(data, kde=True)

    # Adding a title and displaying the plot
    plt.title("Distribution Plot with KDE")
    plt.show()
        

This example creates a distribution plot with the optional Kernel Density Estimate (KDE), which smooths out the histogram into a continuous curve.

Box Plot

A box plot is a great way to visualize the distribution of a dataset and identify outliers. Seaborn provides the sns.boxplot() function for creating box plots. Here’s an example:

    # Creating a box plot
    data = [12, 15, 14, 10, 17, 18, 14, 16, 13, 19, 20, 22, 25, 18, 17]
    sns.boxplot(data=data)

    # Adding a title and displaying the plot
    plt.title("Box Plot")
    plt.show()
        

In this example, a box plot is created to visualize the spread of data and identify any outliers. The plot displays the median, quartiles, and outliers of the dataset.

Violin Plot

A violin plot combines aspects of both box plots and kernel density plots. It displays the distribution of the data, as well as its probability density. Seaborn’s sns.violinplot() function makes it easy to create violin plots:

    # Creating a violin plot
    sns.violinplot(data=data)

    # Adding a title and displaying the plot
    plt.title("Violin Plot")
    plt.show()
        

The violin plot shows the distribution of the dataset along with its density, which is helpful for understanding the distribution in more detail.

Pair Plot

When you have multiple variables and want to explore relationships between them, a pair plot is useful. Seaborn’s sns.pairplot() function creates a grid of subplots that visualize pairwise relationships in a dataset.

    # Importing a sample dataset
    import seaborn as sns
    iris = sns.load_dataset("iris")

    # Creating a pair plot
    sns.pairplot(iris, hue="species")

    # Adding a title and displaying the plot
    plt.title("Pair Plot of Iris Dataset")
    plt.show()
        

The pair plot shows relationships between all numeric variables in the dataset, and the points are colored by the 'species' column to differentiate between different species.

Heatmap

A heatmap is a great way to visualize data in matrix form. It uses color to represent values in the matrix. Seaborn’s sns.heatmap() function is used for creating heatmaps. Here's an example:

    # Creating a heatmap from a correlation matrix
    import numpy as np
    data = np.random.rand(5, 5)

    # Creating a heatmap
    sns.heatmap(data, annot=True)

    # Adding a title and displaying the plot
    plt.title("Heatmap Example")
    plt.show()
        

In this example, we create a 5x5 matrix of random numbers and visualize it as a heatmap. The annot=True option adds the numerical values to each cell of the heatmap.

Conclusion

Seaborn is a powerful tool for creating statistical plots in Python. It provides a simple interface for creating a wide variety of visualizations, including distribution plots, box plots, violin plots, pair plots, and heatmaps. By using Seaborn, you can gain deeper insights into your data through clear and concise visual representations.



Advertisement





Q3 Schools : India


Online Complier

HTML 5

Python

java

C++

C

JavaScript

Website Development

HTML

CSS

JavaScript

Python

SQL

Campus Learning

C

C#

java