Sunday, October 9, 2022

 Research: Apache Spark Cluster in Azure HDInsight:

In this demo, we will work on creating Spark cluster

a) Go to Azure Portal and search for HDInsight


Fill in cluster name, cluster admin password and region. For Storage with default values.

Configuration with D12v2 nodes selection for lesser pricing and finish review+ Create.

b) Create Jupyter notebook by hitting https://<clustername>.azurehdinsight.net/jupyter


On the sign-in page: enter cluster admin and password. Select New PySpark and run queries.


Select File Menu to Close and Halt notebook and release cluster resources







No comments: