Blog
Your blog category
Column Level Encryption using PySpark
If you ever get a requirement to encrypt some of the column’s (Sensitive/Personal Identifiable Information) data before storing it anywhere, then you are at the right place.Below step by step code block can help you achieve this. In this demonstration, Fernet library will be used to generate key which will further be used to encrypt […]
Connect on-premises networks to Azure
VPN Gateway A virtual private network (VPN) is a type of private interconnected network. VPNs use an encrypted tunnel within another network. They’re typically deployed to connect two or more trusted private networks to one another over an untrusted network (typically the public Internet). Traffic is encrypted while traveling over the untrusted network to prevent […]
Azure Databricks Basics with Spark
Overview This blog post will provide an overview of Databricks, Azure Databricks, Apache spark fundamental. In this post you will learn about Databricks concepts (Workspace, Notebook, Cluster, Jobs, Scheduling etc.) and Spark fundamental will cover architecture and key features. Agenda Apache Spark Fundamentals Azure Databricks Pre-Requisites Understanding of Azure basic terminology Understanding of Big data […]
Azure Data Lake Catalog (U-SQL)
The Azure Data Lake Catalog (U-SQL) is one of the ADLA component by which U-SQL organizes data and code for sharing and re-use. The Catalog stores databases, tables, views, stored procedures, table-valued functions (TVFs), schemas, assemblies, external data sources and all other code-related items. Basically catalog are useful when there is a requirement of code […]