record of building and managing real-time data pipelines across a track record of multiple initiatives. Expertise in developing data backbones using distributed streaming platforms (Kafka, SparkStreaming, Flink, etc.). Experience working with cloud platforms such as AWS, GCP, or Azure for real More ❯
record of building and managing real-time data pipelines across a track record of multiple initiatives. Expertise in developing data backbones using distributed streaming platforms (Kafka, SparkStreaming, Flink, etc.). Experience working with cloud platforms such as AWS, GCP, or Azure for real More ❯
in implementing cloud based data solutions using AWS services such as EC2, S3, EKS, Lambda, API Gateway, Glue and bid data tools like Spark, EMR, Hadoop etc. Hands on experience on data profiling, data modeling and data engineering using relational databases like Snowflake, Oracle, SQL Server; ETL tools … like Informatica IICS; scripting using Python, R, or Scala; workflow management tools like Autosys Experience with stream processing systems like Kafka, Sparkstreaming etc Experience in Java, JMS, SOAP, REST, JSON, XML technologies, along with Unix or Linux scripting Implementation experience of DevOps CI/CD More ❯
Annapolis Junction, Maryland, United States Hybrid / WFH Options
SRC
edge solutions in Big Data, Data Science, and Cloud Computing for both government and commercial clients. Dive deeper with cutting-edge tech like Spark, AWS, Azure, Cloudera, Kubernetes, and Google Cloud to build impactful solutions and gain real-time insights with SparkStreaming and More ❯
Annapolis Junction, Maryland, United States Hybrid / WFH Options
SRC
edge solutions in Big Data, Data Science, and Cloud Computing for both government and commercial clients. Dive deeper with cutting-edge tech like Spark, AWS, Azure, Cloudera, Kubernetes, and Google Cloud to build impactful solutions and gain real-time insights with SparkStreaming and More ❯
unit testing, version control, code review). Big Data Eco-Systems, Cloudera/Hortonworks, AWS EMR, GCP DataProc or GCP Cloud Data Fusion. Streaming technologies and processing engines, Kinesis, Kafka, Pub/Sub and Spark Streaming. Experience of working with CI/CD technologies, Git, Jenkins More ❯