Course 1 · Foundations of Blockchain and Cryptography
Module 5: Blockchain Data Structures, Nodes, and Network Topology
This final module examines how blockchain systems organise, propagate, and store data. We explore block structure, node roles, mempool behaviour, and the dynamics of a peer-to-peer network. Together, these elements define how decentralised ledgers maintain global coherence despite latency, partial failure, and high transaction volume.
Learning Outcomes
- Describe the structure of a block and the fields inside a block header.
- Understand how nodes maintain, propagate, and validate data.
- Explain mempool dynamics and how transactions flow through the network.
- Identify different node types: mining nodes, validator nodes, full nodes, light nodes.
- Analyse how peer-to-peer topology supports decentralisation and fault tolerance.
1. Block Structure
A blockchain block is a container for transactions and metadata. While specific block formats differ across protocols (Bitcoin, Ethereum, etc.), they share common elements:
- Block Header
- Block Body (Transactions)
- Metadata and Cryptographic Commitments
1.1 The Block Header
The block header contains the cryptographic commitments that define the block:
- Parent Hash: hash of the previous block
- Merkle Root: summarises all transactions
- Timestamp: when the block was mined/validated
- Nonce: for Proof-of-Work chains
- Difficulty Target: required to validate PoW
- State Root (Ethereum): summarises world state
1.2 Block Body
Contains the ordered list of valid transactions, along with optional metadata (receipts, logs).
2. Node Types and Roles
The blockchain network consists of heterogeneous nodes, each with distinct capabilities and responsibilities.
Store entire blockchain history and validate all blocks/transactions.
Store only block headers; use Merkle proofs for validation.
Propose new blocks using PoW or PoS mechanisms.
Store full history including intermediate states (e.g., Ethereum states).
3. Mempool and Transaction Propagation
Before a transaction is included in a block, it resides in the mempool — a node’s local cache of unconfirmed transactions. Transaction flow follows these steps:
- User signs and broadcasts a transaction.
- Nodes validate the transaction (signature, nonce, balance).
- The transaction enters the mempool.
- Validators/miners select transactions from the mempool.
- The transaction is included in a block.
- The block propagates across the network, confirming the transaction.
Every node maintains its own mempool; mempools are not globally synchronised.
4. Peer-to-Peer Network Topology
Blockchain networks are built on peer-to-peer (P2P) architectures where nodes connect directly to each other without central servers. Key aspects include:
- Gossip Protocols: messages propagate by nodes forwarding data to peers.
- Redundancy: multiple propagation paths reduce failure risk.
- Latency Variation: not all nodes receive data simultaneously.
- Topology Randomisation: prevents targeted attacks.
In a decentralised environment, P2P topology ensures robustness, censorship resistance, and persistence even under partial network partitioning.
5. State, Storage, and Pruning
Over time, blockchain data grows significantly. Different strategies address storage:
- Full History: nodes store every block and transaction.
- Pruned Nodes: keep recent blocks, discard old data safely.
- Archival Nodes: store entire state transitions (Ethereum).
- Light Clients: download only block headers.
Light clients use Merkle proofs to verify inclusion without storing full data — an essential concept from Module 2.
6. Synthesis
Blockchain is not a single structure but a layered architecture involving:
- Data Structures (blocks, Merkle trees, headers)
- Nodes (full, light, archival, validator)
- Transaction Flow (mempools, propagation)
- Network Topology (peer-to-peer, gossip protocols)
Together, these components enable decentralised agreement, data integrity, and resilient operation at global scale. This concludes Course 1, establishing the cryptographic and structural foundations for studying blockchain mechanics and smart-contract engineering in the next courses.
7. Key Terms
- Block Header
- Metadata containing cryptographic commitments.
- Mempool
- Cache of unconfirmed transactions at each node.
- Full Node
- Node that stores full chain and validates everything.
- P2P Network
- Network where nodes connect directly without central servers.
- Pruned Node
- Node that stores only recent blocks to save space.
8. Self-Check Quiz
- What is the purpose of the block header?
- How do full nodes differ from light nodes?
- Why are mempools not globally synchronised?
- Explain why gossip protocols support decentralisation.
- What is the advantage of using Merkle proofs for light clients?