Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
TV & Film
Technology
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts115/v4/90/e9/76/90e97622-62b2-9b2e-d678-4284b64841d6/mza_16925574863160776620.jpg/600x600bb.jpg
The Data Life Podcast
Sanket Gupta
27 episodes
1 week ago
This is a podcast where we talk all-about real life experiences of dealing with data and machine learning tools, techniques and personalities. We cover not just the technical aspects but also the "life" aspects of working in the field. Note: Opinions expressed are my own and do not express the views or opinions of my employer.
Show more...
Technology
RSS
All content for The Data Life Podcast is the property of Sanket Gupta and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
This is a podcast where we talk all-about real life experiences of dealing with data and machine learning tools, techniques and personalities. We cover not just the technical aspects but also the "life" aspects of working in the field. Note: Opinions expressed are my own and do not express the views or opinions of my employer.
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/production/podcast_uploaded_nologo/1452849/1452849-1559791844755-0ffa52463af9d.jpg
17: Why Pandas is the new Excel
The Data Life Podcast
16 minutes 37 seconds
6 years ago
17: Why Pandas is the new Excel

The Data Life Podcast is a podcast where we talk all-about real life experiences with data and data science science tools, techniques, models and personalities. 

In this episode, we will talk about how Pandas is becoming a tool of choice for many data scientists for doing their data analysis work. We will explore how Pandas wins over Excel in several key areas that are important for businesses today:

1) Large dataset sizes
2) Different kinds of input formats such as JSON, CSV, HTML, SQL etc
3) Complex business logic
4) Linking data analysis work to websites and databases
5) Cost

Pandas has lots of helpful functions such as read_csv, read_json, read_sql that allow easy input of data into dataframes. DataFrames have several useful methods like "describe", "value_counts", "groupby", "loc" and more that allow easy understanding of your dataset. It also supports plotting out of the box with "plot" method.

We also cover how Pandas differs from SQL in things like ease of handling time series data, visualizations and more.
Tune in to the episode to learn more about how Pandas might be the tool for your data analysis needs to take your business to next level! 

Fantastic Resources:
1) Book by Pandas creator Wes McKinney: https://www.amazon.com/dp/1491957662/?tag=omnilence-20
2) Great workshop video by Kevin Markham in PyCon: https://www.youtube.com/watch?v=0hsKLYfyQZc
3) Input output methods for Pandas:  https://pandas.pydata.org/pandas-docs/stable/user_guide/io.html
4) Comparison of some operations of Pandas with SQL https://pandas.pydata.org/pandas-docs/stable/getting_started/comparison/comparison_with_sql.html

Thanks for listening! Please consider supporting this podcast from the link in the end. 

The Data Life Podcast
This is a podcast where we talk all-about real life experiences of dealing with data and machine learning tools, techniques and personalities. We cover not just the technical aspects but also the "life" aspects of working in the field. Note: Opinions expressed are my own and do not express the views or opinions of my employer.