Tags: hadoop*

The open source, distributed, parallel computation framework developed by Doug Cutting and Mike Cafarella and based on functional programming operations Map and Reduce, as described in the Google MapReduce paper.

0 bookmark(s) - Sort by: Date โ†“ / Title /

  1. val conf = new SparkConf()

    // Custom Codec that process .gz.tmp extensions as a common Gzip format
    conf.set("spark.hadoop.io.compression.codecs", "smx.ananke.spark.util.codecs.TmpGzipCodec")

    val sc = new SparkContext(conf)

    val data = sc.textFile("s3n://my-data-bucket/2015/09/21/13/*")
    2017-05-24 Tags: , , by klotz

Top of the page

First / Previous / Next / Last / Page 4 of 0 SemanticScuttle - klotz.me: tagged with "hadoop"

About - Propulsed by SemanticScuttle