DPYP

A convenience tool for small-scale data pipelines in Python


â„šī¸ ABOUT

dpyp is a data-pipeline convenience tool containing functionality for reading and writing batches, cleaning data, diagnosing pipelines, manipulating text, and calculating fields in Python.


đŸ“Ĩ INSTALLATION

Dependencies: pandas, pyarrow, numpy


pip install dpyp

🎨 DESIGN

dpyp consists of seven modules each with specialised functionality:

  • - Calculate

  • - Clean

  • - Diagnose

  • - Read

  • - Text

  • - Transform

  • - Write