Python + bi crawls 3000 pieces of cheriki data and discovers these secrets

Soft sails 2021-02-23 17:07:24
python bi crawls pieces cheriki

It's said that the price of cherizi has dropped sharply recently , A lot of people dreamed of it before “ Chelizi is free ”, Now it's all possible . In fact, the price of cheriki has dropped , The main reason is that the cost of import freight is greatly reduced , In order to find the best way to buy cherizi , I decided to use it. python+BI Data analysis .

So I use it on Taobao python Crawling away 3000 Data , And then it's imported into FineBI Visual analysis in the , Finally, the following visual report was produced :

Now let me show you the operation process :

One 、 Data acquisition

use Python Data crawling on Taobao is a commonplace operation , Search directly on Taobao “ Cherry ”, You can see in the product page below , Our main tag this time is “ Name of commodity ”、“ Price ”、“ Number of payers ”、“ Shop name ”、“ Shipping address ” etc. :

Press down F12, Call up the background to view the source code , Find different product tag codes , For example, what is the price “price g_price g_price-highlight">”, The number of payers is “deal-cnt” etc. :

After understanding the code structure of the web page , The next step is to python I wrote the code directly in , The specific process is not described in detail , Part of the code is as follows :

After crawling the data, import it into Excel in , And then in Excel After simple data cleaning and processing , Finally, we get a completed data sheet :

Two 、 Data analysis

python Although it can also realize the function of data analysis , But you need to knock the code , The cost and difficulty of learning are relatively high , It's better to directly use professional data analysis tools for analysis , For example, common ones like FineBI、Tableau、PowerBI etc. .

Now I'm going to take FineBI For example ,FineBI It is a well-known local data analysis tool in China , Compared with tableau The biggest advantage of these foreign tools is that they are simple 、 flexible , All you need to do is drag and drop with the mouse , There's basically no code to write , Very friendly to novices .

Actually FineBI It is essentially an enterprise level business data analysis platform , In addition to data analysis , And data management 、 Data platform building and other functions , I won't go into details here , If you are interested, I'll introduce it next .

With excel Source table , First we will Excel Import to FineBI in :

Then click on the top left corner of the page “ Create a dashboard ”, You can enter the visualization background :

Next, go to the dashboard for visualization , The basic steps are “ Choose chart type —— Choose indicators and dimensions —— Drag to the specified axis —— Beautify the details ”, For example, I want to create a visual map , First, select the chart type as “ Regional map ”, Then you have to choose indicators and dimensions , But there is no geographic latitude in the original data table , So you need to create it yourself :

Last , We drag it to the specified axis , Then beautify the details to make a visual map :

And so on , Other visual charts can also be made according to our own needs , I won't go into details here .

3、 ... and 、 Data visualization

1、 Distribution of sales volume of cherizi

It can be seen that the biggest sales volume of cherizi in China comes from Shanghai , And Zhejiang 、 Guangdong Province , Tibet 、 qinghai 、 There are no sales in Inner Mongolia and other provinces , Basically, sales in coastal areas are higher than those in inland areas .

2、 Sales situation of each province

It's more obvious through the bar chart , The sales volume in Shanghai is 20 More than ten thousand , It's almost Zhejiang 、 guangdong 、 The sum of Sichuan .

3、 The sales of each city

We've screened out the top 10 cities in terms of sales , And the average car price per city , It can be seen that the sales volume and price in Shanghai are the highest , We can see how strong the purchasing power of Shanghai is ;

4、 The price range of chelizi

The data table divides the price range into “50 following ”、“50-100”、“100-150”、“150-200”、“200-500”、“500 above ” etc. , It can be seen that the price range with the largest proportion is “50-100”, It should be civilian price ; It is worth noting that “200-500” The price share of is also higher than “100-150”.

5、 The sales volume and price of each store

It can be seen that the flagship stores have the highest sales volume , The highest average price is basically 600-800 about ;

Four 、 summary

Because there's not a lot of data , So I didn't do too much data analysis this time , You can take the data yourself , stay FineBI There's a lot more analysis going on .

本文为[Soft sails]所创,转载请带上原文链接,感谢

  1. Python notes: List
  2. Translation: practical Python Programming 02_ 03_ Formatting
  3. Python中的四种队列(queue)、堆(heap)
  4. Side effects of Python mutable types as default parameters of functions
  5. This is the best Python tutorial I've ever seen: ten minutes to get to know python
  6. 使用python编写量子线路打印的简单项目,并使用Sphinx自动化生成API文档
  7. Python happy enemy: crawler and anti crawler with a solution to give you New Year
  8. 使用python编写量子线路打印的简单项目,并使用Sphinx自动化生成API文档
  9. When writing python, you will encounter the following error: modulenotfounderror: no module named ' email.mime '; 'email' is not a package
  10. Python class call and private and public property method call
  11. Proprietary methods for Python classes
  12. Foundation of Python: number string and list
  13. Foundation of Python: number string and list
  14. Foundation of Python: number string and list
  15. 华为 Python网络自动化
  16. Python Cannot open E:\Python36\Scripts\
  17. Peeping into the future is not a dream, python data analysis is easy to achieve
  18. The practical skills summed up by Alibaba and Huawei Python engineers, only you haven't seen them yet?
  19. Sour! See the Python programmers on the tiktok get the pay slip...
  20. Foundation of Python: number string and list
  21. Python installation tutorial
  22. Python installation tutorial
  23. This article will familiarize you with the transformation process of Python - > Cafe - > om model
  24. Four kinds of queues and heaps in Python
  25. Using Python to write a simple project of quantum circuit printing, and using Sphinx to automatically generate API documents
  26. Using Python to write a simple project of quantum circuit printing, and using Sphinx to automatically generate API documents
  27. Huawei Python Network Automation
  28. Python Cannot open E:\Python36\Scripts\pip-
  29. 找不到Python问题解决
  30. PHP和Python哪个更有市场前景?我学的是PHP
  31. Python problem resolution not found
  32. Which has more market prospects, PHP or Python? I studied PHP
  33. Foundation of Python: number string and list
  34. python 编码问题之终极解决
  35. The ultimate solution to the problem of Python coding
  36. 能取值亦能赋值的Python切片
  37. Python slice with value and value
  38. 能取值亦能赋值的Python切片
  39. Python slice with value and value
  40. python 异常处理
  41. Python exception handling
  42. python 异常处理
  43. Python exception handling
  44. Orca: 基于DolphinDB的分布式pandas接口
  45. Orca: distributed panda interface based on dolphin DB
  46. 5个无聊Python程序,用Python整蛊你的朋友们吧
  47. Five boring Python programs, trick your friends with Python
  48. python进阶训练营
  49. Python advanced training camp
  50. 【免费】0基础也能轻松学的Python训练营来啦,限时抢位中!
  51. [free] Python training camp, which is easy to learn, is here. It's time to grab a place!
  52. 手把手教你把Python应用到实际开发 不再空谈语法
  53. 全面系统Python3.8入门+进阶 (程序员必备第二语言)
  54. Hand in hand to teach you how to apply Python to practical development
  55. Comprehensive system introduction to Python 3.8 + Advanced
  56. Python语言的排序算法有哪些?Python学习班!
  57. Python language sorting algorithm what? Python classes!
  58. Java、JavaScript、C、C++、PHP、Python都是用来开发什么?
  59. 为什么学习Python?什么途径学习Python合适?
  60. What are Java, JavaScript, C, C + +, PHP and python used to develop?