Sudheer Keshav Bhat · May 16, 2023
cd <source_dir>
zip -r sources.zip .
spark.yarn.appMasterEnv.
like below and pass to spark via --properties-file
param
spark.yarn.appMasterEnv.SEARCH_URL=https://search.domain.com
--files
paramspark-submit --master yarn \
--deploy-mode cluster \
--packages org.apache.hadoop:hadoop-aws:3.3.1,com.amazonaws:aws-java-sdk-emr:1.12.468 \
--properties-file .env \
--files config1.json,config2.json \
--py-files s3://bucket/path/sources.zip \
spark_main.py arg1 arg2