with experience in tools such as: Big data tools: Hadoop, Spark, Kafka, etc. Relational SQL and NoSQL databases, including Postgres and Cassandra. Data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc. AWS cloud services: EC2, EMR, RDS, Redshift (or Azure equivalents) Data streaming systems: Storm, Spark-Streaming, etc. Search tools: Solr, Lucene, Elasticsearch Object-oriented/object function scripting More ❯
programming languages. Strong understanding of graph databases (e.g., RDF, Neo4j , GraphDB). Experience with data modeling and schema design. Knowledge of data pipeline tools and frameworks (e.g., Apache Airflow, Luigi). Excellent problem-solving and analytical skills. Ability to work independently and as part of a team. Clinical knowledge More ❯
Haskell Additional Qualifications: Experience working in a team environment for Git, GitLab, or GitHub Experience scaling data engineering across distributed computing clusters, including Apache Spark, Nifi, Dask, Airflow, or Luigi Experience with SQL and NoSQL database technologies such as Elasticsearch, Solr, HBase, Accumulo, Cassandra, Weaviate, ChromaDB, Pinecone, DuckDB, Neo4j, AWS DynamoDB, Redshift, Aurora, Oracle, PostgreSQL, MSSQL, MySQL, or MongoDB AWS More ❯
them from the ground up. Demonstrated understanding using software and tools including relational NoSQL and SQL databases including Cassandra and Postgres; workflow management and pipeline tools such as Airflow, Luigi and Azkaban; stream-processing systems like Spark-Streaming and Storm; and object function/object-oriented scripting languages including Scala, C++, Java and Python. Familiar with DevOps methodologies, including CI More ❯
e.g. Hadoop, Spark, Kafka, ElasticSearch Data Lakes: e.g. Delta Lake, Apache Hudi, Apache Iceberg Distributed Data Warehouse Frontends: e.g. Apache Hive, Presto Data pipeline and workflow management tools: e.g Luigi, Airflow Dashboard frontends: e.g. Grafana, Kibana Stream-processing systems: e.g. Storm, Spark-Streaming, etc. STR is a growing technology company with locations near Boston, MA, Arlington, VA, near Dayton, OH More ❯
degree in AI/ML, Data Science, Computer Science, or related field. Experience with LLMs, AI agents, NLP, or computer vision. Familiarity with distributed data processing (Spark, Dask, Airflow, Luigi). Hands-on experience with data labeling, curation, and model evaluation workflows. DoD 8140 IAT Level II certification (e.g., Security+ or CISSP). Experience in defense technology or AI-focused More ❯
knowledge of algorithms, design patterns, OOP, threading, multiprocessing, etc. Experience with SQL, NoSQL, or tick databases Experience working in a Unix environment and git Familiarity with Kafka, Docker, AirFlow, Luigi Strong communication skills in verbal and written English. Domain knowledge in futures & swaps is a plus Highly competitive compensation and bonus structure Meritocratic environment with ample opportunity for growth Blue More ❯
evaluation methodologies and key IR metrics Passion for shipping high-quality products and a self-motivated drive to take ownership of tasks Tech Stack Core : Python, FastAPI, asyncio, Airflow, Luigi, PySpark, Docker, LangGraph Data Stores : Vector Databases, DynamoDB, AWS S3, AWS RDS Cloud & MLOps : AWS, Databricks, Ray ️ Unlimited vacation time - we strongly encourage all of our employees take at least More ❯