orc - I am using spark 1.4 and trying to save as orcfile with compression snappy but it saves as zlib -
here code: val df=hivecontext.write.format("orc").options("orc.compression","snappy").save( "xyz")
but file saved zlib.
please help.
you try adding conf "spark.io.compression.codec=snappy" spark-shell / spark-submit:
spark-shell --conf spark.io.compression.codec=snappy #rest of command..
also, writing orc format (assuming in spark >= 1.5) can use:
mydf.orc("/some/path")
the "orc" method doing '.format("orc").save("/some/path")'.
Comments
Post a Comment