Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

 

Introduction

Protocols are standardized rules that govern how data is transmitted, formatted, and processed across systems.

In Big Data, protocols are essential for:

  • Data ingestion (getting data in)
  • Inter-node communication in clusters
  • Remote access to APIs/services
  • Serialization of structured data
  • Security and authorization
ProtocolLayerUse Case Example
HTTP/HTTPSApplicationREST API for ingesting external data
KafkaMessagingStream processing with Spark or Flink
gRPCRPCMicroservices in ML model serving
MQTTMessagingIoT data push to cloud
Avro/ProtoSerializationBinary data for logs and schema
OAuth/KerberosSecuritySecure access to data lakes