pwshub.com

The Real Python Podcast – Episode #217: Packaging Data Analyses & Using pandas GroupBy

Real Python Podcast Episode #217 Title Artwork

The Real Python Podcast

Aug 16, 2024 55m

Copy

Copied!

Happy Pythoning!

What are the best practices for organizing data analysis projects in Python? What are the advantages of a more package-centric approach to data science? Christopher Trudeau is back on the show this week, bringing another batch of PyCoder’s Weekly articles and projects.

Episode Sponsor:

We discuss Joshua Cook’s recent article “How I Use Python to Organize My Data Analyses.” The article covers how his process for building data analysis projects has evolved and now incorporates modern Python packaging techniques.

Christopher shares his recent video course on grouping real-world data with pandas. The course offers a quick refresher before digging into how to use pandas GroupBy to manipulate, transform, and summarize data.

We also share several other articles and projects from the Python community, including a news roundup, working with JSON data in Python, running an Asyncio event loop in a separate thread, knowing the why behind a system’s code, a retro game engine for Python, and a project for vendorizing packages from PyPI.

This episode is sponsored by Mailtrap.

Topics:

  • 00:00:00 – Introduction
  • 00:02:18 – Setuptools Breaks Things, Then Fixes Them
  • 00:04:57 – PEP 751: A File Format to List Python Dependencies
  • 00:07:04 – Python 3.13.0 Release Candidate 1 Released
  • 00:07:15 – Python Insider: Python 3.12.5 released
  • 00:07:22 – Django 5.1 released - Django Weblog
  • 00:07:27 – Django security releases issued: 5.0.8 and 4.2.15
  • 00:07:49 – How I Use Python to Organize My Data Analyses
  • 00:13:45 – Sponsor: Mailtrap
  • 00:14:21 – pandas GroupBy: Grouping Real World Data in Python
  • 00:20:33 – Working With JSON Data in Python
  • 00:25:01 – Asyncio Event Loop in Separate Thread
  • 00:30:33 – Video Course Spotlight
  • 00:31:47 – Habits of great software engineers
  • 00:49:17 – pyxel: A Retro Game Engine for Python
  • 00:52:36 – python-vendorize: Vendorize Packages From PyPI
  • 00:54:18 – Thanks and goodbye

News:

Show Links:

  • How I Use Python to Organize My Data Analyses – This is a description of how Joshua uses Python in a package-centric way to organize his approach to data analyses. This is a system he has evolved while working on his computational biology Ph.D. and working in industry.
  • pandas GroupBy: Grouping Real World Data in Python – In this course, you’ll learn how to work adeptly with the pandas GroupBy while mastering ways to manipulate, transform, and summarize data. You’ll work with real-world datasets and chain GroupBy methods together to get data into an output that suits your needs.
  • Working With JSON Data in Python – In this tutorial, you’ll learn how to read and write JSON-encoded data in Python. You’ll begin with practical examples that show how to use Python’s built-in “json” module and then move on to learn how to serialize and deserialize custom data.
  • Asyncio Event Loop in Separate Thread – Typically, the asyncio event loop runs in the main thread, but as that is the one used by the interpreter, sometimes you want the event loop to run in a separate thread. This article talks about why and how to do just that.

Discussion:

Projects:

Additional Links:

Level Up Your Python Skills With These Courses:

Source: realpython.com

Related stories
1 week ago - Are you interested in practicing your Python skills while learning how to solve astrophysics and astronomy problems? Christopher Trudeau is back on the show this week, bringing another batch of PyCoder's Weekly articles and projects.
5 days ago - How do you take advantage of Git pre-commit hooks? How do you build custom software checks and rules that run every time you commit your code? Christopher Trudeau is back on the show this week, bringing another batch of PyCoder's Weekly...
1 month ago - Do you need help making data tables in Python look interesting and attractive? How can you create beautiful display-ready tables as easily as charts and graphs in Python? This week on the show, we speak with Richard Iannone and Michael...
1 month ago - How do you integrate GraphQL into your Python web development? How about quickly building graph-based APIs inside Django's battery-included framework? Christopher Trudeau is back on the show this week, bringing another batch of PyCoder's...
1 month ago - What hurdles must be cleared when starting an international organization? How do you empower others in a community by sharing responsibilities? This week on the show, we speak with Jay Miller about Black Python Devs.
Other stories
1 hour ago - Ubuntu 24.10 ‘Oracular Oriole’ is released on October 13th, and as you’d expect from a new version of Ubuntu, it’s packed with new features. As a short-term release, Ubuntu 24.10 gets 9 months of ongoing updates, security patches, and...
3 hours ago - Did you know that CSS can play a significant role in web accessibility? While CSS primarily handles the visual presentation of a webpage, when you use it properly it can enhance the user’s experience and improve accessibility. In this...
4 hours ago - Design thinking workshops are your key to turning big problems into clear solutions. In this blog, I share how to run them efficiently and keep your team aligned. The post How to run a design thinking workshop appeared first on LogRocket...
4 hours ago - New memory-optimized X8g instances offer up to 3 TiB DDR5 memory, 192 vCPUs, and 50 Gbps network bandwidth, designed for memory-intensive workloads like databases, analytics, and caching with unparalleled price/performance and efficiency.
4 hours ago - Gain indispensable data engineering expertise through a hands-on specialization by DeepLearning.AI and AWS. This professional certificate covers ingestion, storage, querying, modeling, and more.