Instructions

Build this docker image
Run bash as the command with an iteractive tty to get into the image:

docker run --rm -it ${whatever-you-named-the-image} /bin/bash

The data is in the directory /root/data on said image
Create a Pull Request with your code for review

You're free to use whatever language you want just as long as you include the instructions on how to run your code. (Bonus points if you modify the Dockerfile instead)

Note that you do not have to use a Big Data stack like Hadoop or Spark. If you do use those, provide either a docker-swarm or kubernetes configuration file(s) in your Pull Request that will setup the cluster or else we won't be able to run the code

Questions

what's the average number of fields across all the `.csv` files?

output should be a simple number

sample output

create a csv file that shows the word count of every value of every dataset (dataset being a `.csv` file)

output should be a csv file that has a header row with fields value and count and one entry for every value found:

sample output

value,count
some value,435
another value,234
word,45
...

what's the total number or rows for the all the `.csv` files?

output should be a simple number

sample output

1000000000

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Dockerfile		Dockerfile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Instructions

Questions

what's the average number of fields across all the `.csv` files?

create a csv file that shows the word count of every value of every dataset (dataset being a `.csv` file)

what's the total number or rows for the all the `.csv` files?

About

Uh oh!

Releases

Packages

Languages

charlesharris/screen

Folders and files

Latest commit

History

Repository files navigation

Instructions

Questions

what's the average number of fields across all the .csv files?

create a csv file that shows the word count of every value of every dataset (dataset being a .csv file)

what's the total number or rows for the all the .csv files?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

what's the average number of fields across all the `.csv` files?

create a csv file that shows the word count of every value of every dataset (dataset being a `.csv` file)

what's the total number or rows for the all the `.csv` files?

Packages