Data Engineer_remote
<< Anterior |
Siguiente >> |
Localización: UE
Plants of Tomorrow (PoT) is a strategic program from our Client multinantional swiss, having the goal to empower their business to increase productivity, reduce costs and get insights for new innovation by using Artificial Intelligence and Machine Learning onsite and in real time.
Best of all, it enables their plants to leverage their existing systems to host both plant-specific as well as PoT new range of applications.
For this as part of the PoT ML-OPS team you’ll have the opportunity to manage the complex challenges of scale which are unique to
PoT, while using your expertise in coding, algorithms, complexity analysis and large-scale system design while keeping a results oriented approach.
PoT ML-OPS culture of diversity, intellectual curiosity, problem solving, and innovation oriented towards helping solving business problems and openness is key to its success. Our team brings together people with a wide variety of backgrounds, experiences and perspectives.
We encourage them to collaborate, think big and take risks in a blame-free environment.
We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.
Key Responsibilities:
- Design and implement comprehensive data analytics solutions from start to finish, encompassing data modeling, data integration, and automation of various processes.
- Design,create and maintain optimal data pipelines software
- Extract and assemble large, complex data sets to be consumed directly in machine learning models or analytics applications.
- Proactively enhance and sustain the current data platform and ecosystem through actions such as system configuration, performance optimization, monitoring of data pipelines, and providing operational support to users.
Key Values:
- Self-driven, autonomous, results oriented person
- Enjoys solving business and technical challenges
- Positive and joyful attitude including under stress
- Analytical and practical mindset
- Curiosity to explore, to learn new things and to challenge existing understandings
- Design solutions considering the context, the end result and all the intermediate elements
- Build solutions to be reliable, secure, sustainable and performant while remaining pragmatic in achieving the intermediate objectives
- Courage to take risks, openness to admit errors and move forward by learning from errors
- Perseverance in face of setbacks
Key Skills:
- Expert knowledge of SQL, with the capacity to efficiently extract and collate meaningful information from high-volume sources and structuring it into a comprehensive and usable data set.
- Expertise in programming, with proficiency in at least one major general-purpose programming language, such as Java, C++, C#, or Python (preferred).
- Good understanding of both relational and NoSQL databases, big data platforms, and principles of database and data warehouse modeling.
- Strong analytic skills related to working with unstructured datasets.
- Build processes supporting data collection, cleansing and transformation, data structures, metadata, dependency and workload management.
- Working knowledge of message queuing, stream processing and highly scalable ‘big data’ data stores.
- Strong project management and organizational skills.
- Experience supporting and working with cross-functional teams in a dynamic environment.
- We are looking for a candidate with 5+ years of experience in a Data Engineer, DWH/ETL developer, BI engineer or similar analytics development role, who has attained a Graduate degree in Computer Science, Statistics, Informatics, Information Systems or another quantitative field. They should also have experience using the following software/tools:
- Experience with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
- Experience with big data tools
- Experience with relational SQL and NoSQL databases
- Experience working in the MLOps setup, deploying and scaling multiple products.
- Experience with major cloud data pipeline services like AWS ( EC2, EMR, RDS, Redshift….), GCP (DataFlow, DataPrep, BigQuery, GCS…)…
- Advanced/bilingual English essential, the interviews will be conducted in this language, other languages such as French or German are valuable
Nice to have:
- Advanced skills in Python
- Experience with data pipeline and workflow management tools: Luigi, Airflow, etc.
- Experience with stream-processing systems: Storm, Spark-Streaming, etc.
- Experience with industrial data protocols like OPC DA /OPC UA
We offer:
- A professional career where you will set the limits, we will leave you and encourage you to grow
- Gross remuneration to be negotiated based on the expertise provided, we will not rule out any professional if their assessment is justified.
- 100% teleworking and more social benefits that we will explain to you if you are our candidate.
<<< Anterior |
Siguiente >>> |