klotz: hadoop*

Bookmarks on this page are managed by an admin user.

The open source, distributed, parallel computation framework developed by Doug Cutting and Mike Cafarella and based on functional programming operations Map and Reduce, as described in the Google MapReduce paper.

0 bookmark(s) - Sort by: Date โ†“ / Title / - Bookmarks from other users for this tag

  1. val conf = new SparkConf()

    // Custom Codec that process .gz.tmp extensions as a common Gzip format
    conf.set("spark.hadoop.io.compression.codecs", "smx.ananke.spark.util.codecs.TmpGzipCodec")

    val sc = new SparkContext(conf)

    val data = sc.textFile("s3n://my-data-bucket/2015/09/21/13/*")
    2017-05-24 Tags: , , by klotz

Top of the page

First / Previous / Next / Last / Page 4 of 0 SemanticScuttle - klotz.me: Tags: hadoop

About - Propulsed by SemanticScuttle