Mandatory Skills: Azure Data Factory, Azure Kubernetes, Log Analytics & Azure Monitoring, Azure Data Bricks, Azure data Lake, Azure Storage, Scala/Spark Framework, SnowFlake
Minimum 3 years of experience in below Azure services is must. Azure Data Factory b. Azure Data Bricks c. Azure Data Lake d. Azure StorageMore than 2 years of programing experience in Scala/Spark framework .Good Database designing and development skills and having experience in Snowflake DB.Real time experience in working in a challenging system as dealing with large volume of data.Working experience in Bigdata is added advantage
Designing and implementing highly performant data ingestion pipelines from multiple sources using Apache Spark and/or Azure DatabricksDelivering and presenting proofs of concept to of key technology components to project stakeholders.Developing scalable and re-usable frameworks for ingesting of geospatial data setsIntegrating the end to end data pipleline to take data from source systems to target data repositories ensuring the quality and consistency of data is maintained at all timesWorking with event based / streaming technologies to ingest and process dataWorking with other members of the project team to support delivery of additional project components (API interfaces, Search)Evaluating the performance and applicability of multiple tools against customer requirementsWorking within an Agile delivery / DevOps methodology to deliver proof of concept and production implementation in iterative sprints.
Strong knowledge of Data Management principlesExperience in building ETL / data warehouse transformation processesDirect experience of building data piplines using Azure Data Factory and Apache Spark (preferably Databricks).Experience using geospatial frameworks on Apache Spark and associated design and development patternsMicrosoft Azure Big Data Architecture certification.Hands on experience designing and delivering solutions using the Azure Data Analytics platform (Cortana Intelligence Platform) including Azure Storage, Azure SQL Data Warehouse, Azure Data Lake, Azure Cosmos DB, Azure Stream AnalyticsExperience with Open Source non-relational / NoSQL data repositories (incl. SnowFlake, MongoDB, Cassandra, Neo4J)Experience working with structured and unstructured data including imaging & geospatial data.Experience working in a Dev/Ops environment with tools such as Jenkins