Role Summary
As a Cloud-Based Engineer, you will architect, deploy, and manage the cloud and edge-to-cloud infrastructure that underpins the safety and observability of our Collision Avoidance Systems (CAS). Working at the intersection of embedded systems and big data, you will own the telemetry pipelines, device management, and cloud ingestion paths that keep heavy mobile equipment operating safely. You will modernise LTE/MQTT communication paths and build scalable APIs as we transition hardware to industrial SBCs, ensuring the system remains high-performing and fail-safe.
Key Outcomes
- Resilient Ingestion: Maintain low-latency data paths via LTE/4G using optimised MQTT policies for safety and bandwidth efficiency.
- API Excellence: Deliver production-grade APIs and data models for cross-team use in analytics, simulations, and safety case development.
- Full-Stack Observability: Implement comprehensive monitoring (metrics, traces, logs) and fleet health dashboards with actionable SLOs.
- Hardened Security: Manage secure device identity, provisioning, and lifecycle management (IEC62443) with a focus on least-privilege access.
- Data Integrity: Establish retention policies that provide robust evidence for safety analysis and regulatory compliance.
Responsibilities
- Cloud-Native Design: Build cloud services for CAS telemetry and device messaging using MQTT, REST, and gRPC.
- Pipeline Engineering: Develop streaming and batch data pipelines to process field logs and metrics for fleet-wide analytics.
- Architecture Collaboration: Partner with Principal Architects to analyse and visualise data for new feature development.
- Contract Management: Define and protect message schemas across onboard, roadside, and cloud boundaries to ensure backward compatibility.
- Scalable Storage: Operate high-performance storage layers (Time-series, Object, Relational) with cost-effective indexing and lifecycle policies.
- Observability Standards: Lead the implementation of OpenTelemetry, centralised logging, and on-call readiness for cloud services.
- Security & IAM: Manage PKI, certificate rotations, and role-based access control (RBAC) for all internal dashboards and services.
- Safety Validation: Collaborate with embedded engineers to translate safety requirements into cloud-side watchdogs and command rate limits.
- Operational Readiness: Create interface specifications, deployment runbooks, and CI/CD pipelines using Infrastructure-as-Code (Terraform).
Your Technical Profile
- Cloud & IoT: 3+ years in cloud platform engineering for IoT or safety-critical systems.
- Connectivity Mastery: Deep expertise in MQTT (QoS, session state), device identity, and cellular networking (LTE/4G).
- Backend Development: Strong coding skills in C#, Go, Python, or Java with experience in message brokers and event streaming.
- Infrastructure as Code: Expert knowledge of AWS, Azure, or GCP, including Docker, Kubernetes (K8s), and Terraform.
- Observability Tooling: Proficiency with Prometheus, OpenTelemetry, and log pipeline management.
- Data Engineering: Skilled in schema design (Protobuf, Avro, JSON) and time-series data governance.
- Security-by-Design: Experience with PKI, secrets management, and threat modelling for device-to-cloud interactions.
Highly Regarded Extra Strengths:
- Front-end development skills in ReactJS, TypeScript, or Figma.
- Experience with V2X/DSRC telemetry and spatiotemporal data visualisation.
- Familiarity with Digital Twins, replay frameworks, and GNSS data handling.
- Knowledge of RS-232 integration and control system observability.
DRD Group is a specialist recruitment provider renowned for recruiting quality project teams on major projects within the Energy, Resources, Defence, Renewables and Infrastructure sectors.
DRD Group embraces a diverse workforce culture and encourages applicants from all backgrounds to apply. All work assignments are subject to successful completion of a medical, drug and alcohol screen plus verification of all original certifications and qualifications.
#SCR-fab-frascari
