Single Node to Distributed System
Are the concepts of Distributed System Unique or Are they borrowed from Single Node System
The important resources at disposal on a single node are
- CPU
- RAM
- Network
- File-system
Single Node and their Counter component on a Distributed System(YARN + HDFS )
| Single Node | Distributed Systems | |
| Example | Linux Box | Hadoop Stack |
| File-system | extfs | HDFS |
| CPU | CPU and cores | CPU and Cores across multiple machine(YARN) |
| RAM | RAM on a single machine | RAM across multiple machines(YARN) |
| Process | Process | Process across multiple machine(YARN) |
[addToAppearHere]
- File-system
| Single Node | Distributed Systems | |
| File-system | extFS | HDFS |
| File | File is broken in tracks and sectors and saved on same machine | File is divided HDFS blocks . The blocks are saved across machines |
| Command to know where a file is saved | stat fileName | hadoop fsck /path/to/file -files -blocks |
| To enable Data recovery in case of failure | extfs provide journaling | maintains Replica of blocks |