登录查看更多内容

NameNode Server in HDFS

Babak Rezaei Bastani

Senior Web Developer

发布日期: 2019年7月11日

The main node in HDFS is that it maintains and manages the blocks on the DataNodes. NameNode is a very high-availability server that manages the file namespace and controls user access to files. The HDFS architecture is such that user data is not mounted on NameNode. These data are only mounted on NameNodes.

NameNode's mission are as follows:

A major process that maintains and manages DataNodes.
It records the metadata of all files stored in the cluster (for example, the location of stored blocks, file sizes, permissions, hierarchies, etc.). Two files associated with metadata are:

FsImage: includes the full status of the file system namespace since the start of the NameNode.
EditLogs: Includes all recent changes to the latest FsImage.

Any changes that occur on the file system metadata are logged. For example, if a file is deleted in HDFS, NameNode immediately logs it in EditLog.
It regularly receives heart beats and block's report from all DataNodes to ensure their viability.
Holds a record of all blocks in the HDFS, and there are nodes in which the blocks are located.
It is also responsible for taking care of all factors of the replication of all blocks.
In the event of a DataNode failure, it selects a new DataNode for duplication, use of disk balancing and traffic control for DataNode.

要查看或添加评论，请登录

Babak Rezaei Bastani的更多文章

HDFS Architecture (Basic concepts)

2019年7月11日

HDFS Architecture (Basic concepts)

HDFS is a blocked file system in which each file is split into blocks of predefined size. These blocks are stored in…
What is MapReduce?

2019年6月30日

What is MapReduce?

MapReduce is a processing method and a Java-based distribution model for distributed computing. The MapReduce algorithm…
HDFS goals

2019年6月28日

HDFS goals

Fault detection and recovery : Because HDFS contains a large number of commodity hardware, the probability of failure…
An overview of HDFS

2019年6月28日

An overview of HDFS

The Hadoop file system was developed using distributed file system design and runs on commodity hardware. Unlike other…
Introduction to Hadoop

2019年6月27日

Introduction to Hadoop

Hadoop is an apache-based open source framework written in Java programming language, which allows simple…
Data Science Processing Tools

2019年6月11日

Data Science Processing Tools

Once learned with data storage, you need to be familiar with data processing tools for converting data lakes to data…
Data Warehouse Bus Matrix

2019年6月8日

Data Warehouse Bus Matrix

The Enterprise Bus Matrix is a data warehouse planning tool developed by Ralph Kimball and is being used by numerous…
Data vault

2019年6月8日

Data vault

Data vault modeling, designed by Dan Linstedt, is a database modeling method that has been deliberately structured in…
Data Lake

2019年6月7日

Data Lake

A Data lake is a data storage tank for a large amount of raw data. Waiting for future needs, the data lake saves the…
Data Science Storage Tools

2019年6月6日

Data Science Storage Tools

The data science ecosystem has a set of tools that we use to build our solutions. The capabilities of this environment…

See all articles

NameNode Server in HDFS

Babak Rezaei Bastani

Senior Web Developer

Babak Rezaei Bastani的更多文章

社区洞察

其他会员也浏览了

OtterTune Raises 12 Million USD for Database Maintenance Automation

Rebalancing the Partitions and How not to do it

How to install and configure IBM Infosphere Data Replication

Pgackrest and Minio, the perfect match

How Did I resolve sudden CPU Spike of SQL Server

SQL Server Notes by AB | Note #31 | Database-wise CPU Cost | #ABSQLNotes

SQL Server Notes by AB | Note #25 | Parallelism & CXPACKET | #ABSQLNotes

SQL Server Notes by AB | Note #21 | Processor % Processor Time vs Process % Processor Time | #ABSQLNotes

PIT mounted Filesystem Design

SQL Server Notes by AB | Note #15 | Identifying Workloads That Are Causing High IO | #ABSQLNotes

Babak Rezaei Bastani的更多文章

HDFS Architecture (Basic concepts)

What is MapReduce?

HDFS goals

An overview of HDFS

Introduction to Hadoop

Data Science Processing Tools

Data Warehouse Bus Matrix

Data vault

Data Lake

Data Science Storage Tools

社区洞察

其他会员也浏览了

OtterTune Raises 12 Million USD for Database Maintenance Automation

Rebalancing the Partitions and How not to do it

How to install and configure IBM Infosphere Data Replication

Pgackrest and Minio, the perfect match

How Did I resolve sudden CPU Spike of SQL Server

SQL Server Notes by AB | Note #31 | Database-wise CPU Cost | #ABSQLNotes

SQL Server Notes by AB | Note #25 | Parallelism & CXPACKET | #ABSQLNotes

SQL Server Notes by AB | Note #21 | Processor % Processor Time vs Process % Processor Time | #ABSQLNotes

PIT mounted Filesystem Design

SQL Server Notes by AB | Note #15 | Identifying Workloads That Are Causing High IO | #ABSQLNotes