Creditvidya
Home
Careers
About
  • Read Spark Parquet Output Using Python Job Using Arrow

    Jun 10, 2021 · 2 min read · Hadoop BigData Apache Spark Apache Arrow Python Parquet Pandas Dataframe  ·

    We have a lot of different jobs which run on our Spark Clusters and produce the output in different formats as per the job specifications. Generally the data format that we use are json, parquet and parquet with data partitioning. There is this one case where we wanted the paquet output data generated by one of the …

    Read More

Featured Posts

  • Encoding vs Encryption vs Hashing vs Obfuscation

Recent Posts

  • Read Spark Parquet Output Using Python Job Using Arrow
  • Encoding vs Encryption vs Hashing vs Obfuscation
  • How The Pandemic Made Infrastructure Leaner

Categories

ENGINEERING 2 TECHNOLOGY 2 DATA-ANALYTICS 1 DATA-ENGINEERING 1 DATA-PROCESSING 1 DEVOPS 1 SECURITY 1

Tags

APACHE-ARROW 1 APACHE-SPARK 1 AWS 1 BIGDATA 1 CLOUD 1 CLOUDFORMATION 1 COST-REDUCTION 1 DATAFRAME 1 EBS 1 ENCODING 1 ENCRYPTION 1 HADOOP 1 HASHING 1 OBFUSCATION 1
All Tags
APACHE-ARROW1 APACHE-SPARK1 AWS1 BIGDATA1 CLOUD1 CLOUDFORMATION1 COST-REDUCTION1 DATAFRAME1 EBS1 ENCODING1 ENCRYPTION1 HADOOP1 HASHING1 OBFUSCATION1 PANDAS1 PARQUET1 PASSWORDS1 PYTHON1 S31 SECURITY1 SPOT1
[A~Z][0~9]
Creditvidya

Copyright  CREDITVIDYA. All Rights Reserved