View on GitHub

skelebot

Machine Learning Project Development Tool

Machine learning projects are too often just a loose collection of unorganized scripts. Skelebot aims to bring order to that chaos.


Purpose

Skelebot is a command-line tool for developing machine learning projects and executing them in Docker. The purpose of Skelebot is to simply make the life of a Data Scientist easier by doing a lot of the legwork for mundane tasks automatically through a unified, consistent interface.

By allowing jobs to be executed in Docker, it removes the need for the developer to install specific R and Python packages on their own machine, and even removes the need to have R installed at all. Configured jobs in Skelebot also come with the added benefit of built-in help documentation in order to assist others in understanding what jobs your project has, and what those jobs do.

Skelebot also saves developer time by integrating with HDFS through Kerberos automatically. By building on top of a library of pre-built Docker images tailored specifically for Skelebot’s purposes, the process of building the Docker image for a project is greatly reduced. Skelebot also encourages a specific folder structure through it’s scaffolding process thereby introducing consistency across projects and developers. By providing a uniform interface on which to discover the project’s jobs it greatly helps to reduce the barrier to entry for newcomers to the project.


Features


Getting Started