SQL interface for structured data processing
Spark SQL allows you to run SQL queries on your DataFrames after registering them as temporary views.
# Register DataFrame as a temporary view
df.createOrReplaceTempView("people")
# Run SQL query
results = spark.sql("""
SELECT name, age
FROM people
WHERE age > 30
ORDER BY age DESC
""")
results.show()
df.select("name", "age")
.filter(df.age > 30)
.orderBy(df.age.desc())
SELECT name, age
FROM people
WHERE age > 30
ORDER BY age DESC