我正在关注本教程:https ://docs.microsoft.com/en-us/learn/modules/intro-to-azure-databricks/4-using-notebooks
在本教程中,我们创建一个这样的数据库:
%sql
CREATE DATABASE IF NOT EXISTS Databricks;
USE Databricks;
CREATE TABLE IF NOT EXISTS AirlineFlight
USING CSV
OPTIONS (
header="true",
delimiter=",",
inferSchema="true",
path="dbfs:/mnt/training/asa/flights/small.csv"
);
CACHE TABLE AirlineFlight;
SELECT * FROM AirlineFlight;
这个数据库是在哪里创建的?此外还有一个问题
Question: Which of the following are good applications for Apache Spark? (Select all that apply.)
Querying, exploring, and analyzing very large files and data sets
Joining data lakes
Machine learning and predictive analytics
Processing streaming data
Graph analytics
Overnight batch processing of very large files
Updating individual records in a database
Answer: All but #7. Apache Spark uses SQL to read and performs analysis on large files, but it is not a Database.
如果我们可以使用 spark 创建数据库,那么为什么我们不能也更改它的记录