Retrospect is a system for scalable data analysis that combines the simplicity of Python with built‑in map‑reduce, a fast abstract machine, and a framework for distributed computation, aiming to provide easy‑to‑use, expressive, and scalable data analysis.
output = sqlContext.sql("SELECT * From people")
output.registerTempTable('people2')
sqlContext.cacheTable('people2')
sqlContext.sql("SELECT count(*) From people2").collect()