Comparing changes

Added support for writing to file with dateFormat that is specified in the options. Added some basic tests for it. I'm continuing to use SimpleDateFormat since that is what is used for reading. I would prefer joda's DateTimeFormat, but that can be a separate issue. It's too bad CSVFormat does not allow specification of a dateFormat as it does for the null value. That would make this much easier. Author: barrybecker <barrybecker@sgi.com> Closes databricks#310 from barrybecker4/master.

I know databricks#310 was recently merged and closed, but I just made an additional change to fix a problem I noted when null dates are in the data. A check for null needs to be done. Let me know if I need to create a separate issue or if you can just merge this. Author: barrybecker <barrybecker@sgi.com> Author: Barry Becker <BBE@esi-group.com> Closes databricks#360 from barrybecker4/master.

Changed whitesspaces to white space Author: Mayank-Shete <mayank.shete@gmail.com> Closes databricks#331 from Mayank-Shete/master.

Currently, the maxCharactersPerColumn value is hardcoded to 100,000. I (sadly) have CSV fields with more than 100k characters so need to be able to configure this. This allows the value to be passed the same way as other paramters while keeping a backward compatible default of 100k Author: Addison Higham <ahigham@instructure.com> Closes databricks#307 from addisonj/master.

Changes for 1.5 and final release Author: Hossein <hossein@databricks.com> Closes databricks#379 from falaki/changes-for-v1.5.0.

https://github.com/databricks/spark-csv/issues/370 This PR fixes `nullValue` handling for `StringType`. It seems it'd be better to treat all the types consistently. Author: hyukjinkwon <gurwls223@gmail.com> Closes databricks#384 from HyukjinKwon/null-string.

I think it is better to delete ";" to match another scala source styles. Author: chie hayashida <chie8842@gmail.com> Closes databricks#395 from hayashidac/master.

The type inference mechanism takes into account malformed lines, even though the user may have selected to drop them. In my opinion, this leads to faulty results, since the inferred types get distorted by unruly lines that may have more or less columns. Since those lines get dropped anyway during the creation of the dataframe, they shouldn't be taken into account during the type inference as well. Author: Georgios Gkekas <gkekas@gmail.com> Author: Georgios Gkekas <Georgios Gkekas> Author: digigek <gkekas@gmail.com> Closes databricks#394 from digigek/inference-malformed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comparing changes

Open a pull request

Uh oh!

Commits on Jul 12, 2016

Commits on Jul 26, 2016

Commits on Sep 5, 2016

Commits on Oct 1, 2016

Commits on Jan 7, 2017

Commits on Jan 9, 2017

This comparison is taking too long to generate.

Uh oh!