It’s all the time good to course of the info utilizing trendy instruments like Pandas or Jupyter. However let’s think about the case when a colleague or buddy asks to make an information evaluation, however she or he just isn’t a technical individual, doesn’t use Python or Jupyter, and doesn’t have any account in Tableau, Energy BI, or another fancy (however, alas, not free) service. On this case, processing the info in Google Sheets is usually a good workaround due to a number of causes:
- Google is used worldwide; on the time of writing this text, greater than 1.8 billion customers have a Google Account. Virtually virtually everybody has a Google account these days, and doc sharing shall be extraordinarily simple.
- Google’s ecosystem is protected and safe. It helps two-factor authentication and trendy safety requirements, and even non-public datasets may be shared between restricted teams of individuals.
- Final however not least, the answer is free and doesn’t require any further prices. And as a bonus, Google Sheets works within the browser, doesn’t require putting in any software program, and might work on any platform like Home windows, Linux, OSX, and even on a smartphone.
On this article, I’ll make a fundamental exploratory information evaluation in Pandas, then we’ll repeat this course of in Google Sheets and see the way it works.
To make issues extra enjoyable, let’s use an actual dataset. We are going to make a instrument to calculate the vitality generated by photo voltaic panels. To do that, I’ll use the PVGIS (European Fee Picture Voltaic Geographical Info System) information, which may be accessed without cost through this URL (CC BY 4.0 Licence):
Utilizing this web page, we are able to obtain photo voltaic irradiation information, permitting us to calculate vitality era. As may be seen within the screenshot, we are able to choose hourly information for various years and totally different places. After downloading the info, let’s use it in Pandas.
EDA in Pandas
Let’s begin with exploratory information evaluation (EDA) in Pandas. It’s all the time simpler to…