posted on 2023-05-21 17:53 read(410) comment(0) like(27) collect(2)
Chapter 1 draws a line chart
using matplotlib Chapter 2 draws a bar chart
using matplotlib Chapter 3 draws a histogram using matplotlib
Chapter 4 draws a scatter chart using matplotlib
Chapter 5 draws a pie chart using matplotlib
Chapter 6 draws a heat map using matplotlib
Chapter 7 uses matplotlib to draw stacked bar charts
Chapter 8 uses matplotlib to draw multiple graphs in one canvas
As the saying goes, a picture is worth a thousand words. Data visualization is to display data in a graphical way, which makes it easier for us to observe the laws contained in the data. After gaining insight into the laws contained in the data, we can make better business decisions.
A line chart is a statistical chart composed of points and lines, often used to represent changes in values over continuous time intervals or ordered categories. In a line chart, the x-axis is often used as a continuous time interval or as an ordered category (such as stage 1, stage 2, stage 3). The y-axis is used for quantified data, and if it is negative, it will be drawn below the y-axis. Links are used to connect two adjacent data points.
Line charts are used to analyze trends in things over time or in ordered categories. If there are multiple sets of data, it is used to analyze the interaction and influence of multiple sets of data over time or ordered categories. The direction of the broken line indicates positive/negative change. The slope of the broken line indicates the degree of change.
In this lesson we will see how to make a line chart. First we need to install the matplotlib library, which can pip install matplotlib
be . After the matplotlib library is installed, we can use the matplotlib library to draw. For example:
from matplotlib import pyplot as plt
dev_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35]
dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752]
plt.plot(dev_x, dev_y)
plt.show()
The first line of the above code imports pyplot from matplotlib and aliases plt. dev_x、dev_y
They are two lists, each containing 11 data, and then call the plot function to draw the graph. The parameters of the plot function are two sets of data. After calling the plot function, you need to call the show function, otherwise the graph will not be displayed. The graph obtained by the above code is as follows:
The meanings represented by the abscissa and ordinate and the title of the entire graph are not identified in the above graph. Let's add the above information:
from matplotlib import pyplot as plt
dev_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35]
dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752]
plt.plot(dev_x, dev_y)
plt.xlabel("年龄")
plt.ylabel("年薪")
plt.title("年龄和薪水的关系")
plt.show()
The above code adds the meaning represented by the abscissa through the xlabel function, adds the meaning represented by the ordinate through the ylabel function, and adds the title of the graph through the title function. The graph after adding the above information is shown in the figure below:
the graph above contains only one polyline, now let’s add another curve. code show as below:
from matplotlib import pyplot as plt
dev_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35]
dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752]
plt.plot(dev_x, dev_y)
py_dev_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35]
py_dev_y = [45372, 48876, 53850, 57287, 63016, 65998, 70003, 70000, 71496, 75370, 83640]
plt.plot(py_dev_x, py_dev_y)
plt.xlabel("年龄")
plt.ylabel("年薪")
plt.title("年龄和薪水的关系")
plt.show()
The graph drawn is as follows:
In the above code, we added a polyline through another set of data. Looking at the code carefully, we can find that dev_x
and py_dev_x
are the same. For the sake of code brevity, we can only keep one copy, for example:
from matplotlib import pyplot as plt
ages_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35]
dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752]
plt.plot(ages_x, dev_y)
py_dev_y = [45372, 48876, 53850, 57287, 63016, 65998, 70003, 70000, 71496, 75370, 83640]
plt.plot(ages_x, py_dev_y)
plt.xlabel("年龄")
plt.ylabel("年薪")
plt.title("年龄和薪水的关系")
plt.show()
The graph generated after executing the above code is shown in the figure below:
Since there are two broken lines in the graph, the meaning of each broken line is not marked in the graph, so we cannot distinguish the two broken lines. In order to distinguish the two broken lines To distinguish, we need to label the two polylines separately. For example:
from matplotlib import pyplot as plt ages_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35] dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752] plt.plot(ages_x, dev_y) py_dev_y = [45372, 48876, 53850, 57287, 63016, 65998, 70003, 70000, 71496, 75370, 83640] plt.plot(ages_x, py_dev_y) plt.xlabel("年龄") plt.ylabel("年薪") plt.title("年龄和薪水的关系") plt.legend(['全部开发者','Python开发者']) plt.show()
The graph drawn is as follows:
In the above code, we use the legend function to label the two polylines. The parameter of the legend function is a list, and the order of the elements in the list must be consistent with the drawing order of the polylines, otherwise it will cause confusion. When we change the drawing order of the polylines, we need to change the order of the elements in the list that is the parameter of the legend function simultaneously. This is the defect of the above method, let's look at another method.
from matplotlib import pyplot as plt ages_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35] dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752] plt.plot(ages_x, dev_y, label="全部开发者") py_dev_y = [45372, 48876, 53850, 57287, 63016, 65998, 70003, 70000, 71496, 75370, 83640] plt.plot(ages_x, py_dev_y, label="Python开发者") plt.xlabel("年龄") plt.ylabel("年薪") plt.title("年龄和薪水的关系") plt.legend() plt.show()
The above method adds the label by adding the parameter label in the plot function. After adding the parameter label, we also need to call the legend function, otherwise the label will not be displayed, but at this time calling the legend function does not need to pass in parameters. The graphic after adding labels to the polyline is shown in the following figure:
The styles of the above graphics are default, and we can also set the style of the graphics through parameters, including the color, shape and thickness of the polyline, etc. For example:
from matplotlib import pyplot as plt ages_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35] dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752] plt.plot(ages_x, dev_y, label="全部开发者", color="blue", marker=".", linestyle="-") py_dev_y = [45372, 48876, 53850, 57287, 63016, 65998, 70003, 70000, 71496, 75370, 83640] plt.plot(ages_x, py_dev_y, label="Python开发者", color="green", marker=".", linestyle="--") plt.xlabel("年龄") plt.ylabel("年薪") plt.title("年龄和薪水的关系") plt.legend() plt.show()
In the above code, we specify the color of the graph through the color parameter, the marker parameter specifies the mark of each data point, and the linestyle parameter specifies the shape of the graph. The graphic after executing the above code is shown in the figure below:
The value of the color parameter can also be a Hex value, and the Hex value is composed of six hexadecimal numbers, the first two hexadecimal numbers represent the intensity of red, and the middle two hexadecimal numbers represent the intensity of red. One-digit hexadecimal number represents the intensity of green, and the last two hexadecimal digits represent the intensity of blue. For example:
from matplotlib import pyplot as plt ages_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35] dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752] plt.plot(ages_x, dev_y, label="全部开发者", color="#FF0000", marker=".", linestyle="-") py_dev_y = [45372, 48876, 53850, 57287, 63016, 65998, 70003, 70000, 71496, 75370, 83640] plt.plot(ages_x, py_dev_y, label="Python开发者", color="#00FF00", marker=".", linestyle="--") plt.xlabel("年龄") plt.ylabel("年薪") plt.title("年龄和薪水的关系") plt.legend() plt.show()
In the above code, the color #FF0000
specified is red, and #00FF00
the color specified by the Hex value is green. The graph after executing the above code is shown in the figure below:
we can also add grid lines to the graph, for example:
from matplotlib import pyplot as plt ages_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35] dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752] plt.plot(ages_x, dev_y, label="全部开发者", color="#FF0000", marker=".", linestyle="-") py_dev_y = [45372, 48876, 53850, 57287, 63016, 65998, 70003, 70000, 71496, 75370, 83640] plt.plot(ages_x, py_dev_y, label="Python开发者", color="#00FF00", marker=".", linestyle="--") plt.xlabel("年龄") plt.ylabel("年薪") plt.title("年龄和薪水的关系") plt.legend() plt.grid(True) plt.show()
In the above code, call the grid function and pass in the True parameter. After executing the above code, the graph is shown in the figure below:
In order to make the blank area in the graph less, we call the tight_layout function to automatically fill the blank area, for example:
from matplotlib import pyplot as plt ages_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35] dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752] plt.plot(ages_x, dev_y, label="全部开发者", color="#FF0000", marker=".", linestyle="-") py_dev_y = [45372, 48876, 53850, 57287, 63016, 65998, 70003, 70000, 71496, 75370, 83640] plt.plot(ages_x, py_dev_y, label="Python开发者", color="#00FF00", marker=".", linestyle="--") plt.xlabel("年龄") plt.ylabel("年薪") plt.title("年龄和薪水的关系") plt.legend() plt.grid(True) plt.tight_layout() plt.show()
After adding the call of the tight_layout function, the generated graphics are shown in the figure below:
Above we change the style of the graphics by setting parameters, and pyplot also comes with some styles for us to use directly, for example:
from matplotlib import pyplot as plt plt.style.use('fivethirtyeight') ages_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35] dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752] plt.plot(ages_x, dev_y, label="全部开发者") py_dev_y = [45372, 48876, 53850, 57287, 63016, 65998, 70003, 70000, 71496, 75370, 83640] plt.plot(ages_x, py_dev_y, label="Python开发者") plt.xlabel("年龄") plt.ylabel("年薪") plt.title("年龄和薪水的关系") plt.legend() plt.grid(True) plt.tight_layout() plt.show()
The above code uses the fivethirtyeight style, and the generated graphics are as shown in the figure below:
There are other styles, such as ggplot:
from matplotlib import pyplot as plt plt.style.use('ggplot') ages_x = [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35] dev_y = [38496, 42000, 46752, 49320, 53200, 56000, 62316, 64928, 67317, 68748, 73752] plt.plot(ages_x, dev_y, label="全部开发者") py_dev_y = [45372, 48876, 53850, 57287, 63016, 65998, 70003, 70000, 71496, 75370, 83640] plt.plot(ages_x, py_dev_y, label="Python开发者") plt.xlabel("年龄") plt.ylabel("年薪") plt.title("年龄和薪水的关系") plt.legend() plt.grid(True) plt.tight_layout() plt.show()
The resulting graph is shown below:
Trends in the same variable over time or ordinal categories, such as the trend in salary over age in the example above.
In this chapter, we introduce the drawing of the line graph, including the setting of the graphic style and the applicable and unapplicable scenarios of the line graph.
Next Chapter Drawing Bar Charts with matplotlib
Author:Disheartened
link:http://www.pythonblackhole.com/blog/article/25318/21f9c6fd046dfe39465d/
source:python black hole net
Please indicate the source for any form of reprinting. If any infringement is discovered, it will be held legally responsible.
name:
Comment content: (supports up to 255 characters)
Copyright © 2018-2021 python black hole network All Rights Reserved All rights reserved, and all rights reserved.京ICP备18063182号-7
For complaints and reports, and advertising cooperation, please contact vgs_info@163.com or QQ3083709327
Disclaimer: All articles on the website are uploaded by users and are only for readers' learning and communication use, and commercial use is prohibited. If the article involves pornography, reactionary, infringement and other illegal information, please report it to us and we will delete it immediately after verification!