-
Notifications
You must be signed in to change notification settings - Fork 17
Description
The DsKit project aims to simplify the data science and machine learning workflow by providing a unified, production-ready toolkit that wraps commonly used operations—such as data cleaning, exploratory data analysis, feature engineering, and modeling—into simple, easy-to-use functions. This approach helps data scientists and ML engineers reduce the significant amount of time spent on repetitive boilerplate tasks and focus more on problem-solving and experimentation.
My proposed contribution focuses on strengthening the foundation of DSKIT by improving the quality, reliability, and usability of its existing utilities. This includes enhancing function docstrings, adding meaningful unit tests, creating example notebooks, and introducing small but impactful data cleaning utilities. These improvements are important because they directly affect how easily users can understand, trust, and adopt the library in real-world projects.