klotz: spark* + python*

Bookmarks on this page are managed by an admin user.

0 bookmark(s) - Sort by: Date / Title ↑ / - Bookmarks from other users for this tag

  1. In this post we will first discuss how to set up Spark to start easily performing analytics, either simply on your local machine or in a cluster on EC2. We then will explore Spark at an introductory level, moving towards an understanding of what Spark is and how it works (hopefully motivating further exploration). In the last two sections we will start to interact with Spark on the command line and then demo how to write a Spark application in Python and submit it to the cluster as a Spark job.
    2015-02-05 Tags: , , by klotz
  2. import findspark
    findspark.init()
    import pyspark # Call this only after findspark.init()
    from pyspark.context import SparkContext
    from pyspark.sql.session import SparkSession

    sc = SparkContext.getOrCreate()
    spark = SparkSession(sc)
    2020-05-26 Tags: , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: spark + python

About - Propulsed by SemanticScuttle