Understanding Big Data File Systems - HDFS and DBFS HD

15.07.2020
Before getting into Data Ingestion using NiFi as well as Data Processing using Spark let us get an overview of File Systems used in the Big Data ecosystem. Join this channel to get access to perks: https://www.youtube.com/channel/UCakdSIPsJqiOLqylgoYmwQg/join Let us get an overview of File Systems that are used as part of Big Data Clusters, both on-prem as well as the cloud. Here is the link for the material covered as part of the session - https://github.com/dgadiraju/itversity-books/blob/master/Data%20Engineering%20Bootcamp/40%20Big%20Data%20ecosystem%20-%20Overview/02%20Understanding%20Big%20Data%20File%20Systems%20-%20HDFS%20and%20DBFS.ipynb * Understanding Storage Servers * List of File Systems * Understanding Hadoop Storage (HDFS) * HDFS Architecture * HDFS Commands – Overview * Customizing Properties * Overview of DBFS Commands Similar to HDFS and DBFS we use AWS or Azure specific commands to manage files on S3 or Azure Blob respectively. They provide web interfaces as well for the same. Here is the complete playlist about Spark for Certifications: https://www.youtube.com/playlist?list=PLf0swTFhTI8rMmW7GZv1-z4iu_-TAv3bi Here is the complete playlist about Free Data Engineering Bootcamp: https://www.youtube.com/playlist?list=PLf0swTFhTI8pBe2Vr2neQV7shh9Rus8rl * Join our Meetup group - https://www.meetup.com/itversityin/ * Enroll for our labs - https://labs.itversity.com/plans * Subscribe to our YouTube Channel for Videos - http://youtube.com/itversityin/?sub_confirmation=1 * Access Content via our GitHub - https://github.com/dgadiraju/itversity-books * Lab and Content Support using Slack

Похожие видео

Показать еще