Data Engineer

  • Tottenham Hotspur Football & Athletic Co Ltd
  • Tottenham, London, UK
  • 12 May, 2024
Full time Data Science Engineer Premier League

Job Description

Job description

Founded in 1882, Tottenham Hotspur Football Club is an English Premier League Club, based in North London.

 

Led by the late great Bill Nicholson, the Club became the first in England to win the League and FA Cup Double in 1961, and the first in the UK to win a European Trophy two years later. Spurs has since been home to some of the game’s great entertainers, including Jimmy Greaves, Glenn Hoddle, Paul Gascoigne, David Ginola, Gareth Bale, Heung-Min Son and Harry Kane.

 

In April 2019, the Club opened an iconic new stadium that sits at the heart of a £1billion sport-led regeneration of North Tottenham. The stadium is the largest football club stadium in London and is a multi-use venue with the ability to host a variety of events 365 days a year, including NFL, boxing, rugby, concerts, and other major events, plus visitor attractions including Stadium Tours and the Dare Skywalk.

 

The stadium development scheme has to date created more than 4,000 new jobs for local people, with circa £300m pumped into the local economy each year.

 

Tottenham Hotspur has:

 

  • A clear strategy to develop talent from within its Academy, showcased by a strong track record of Academy players graduating to the first-team squad.
  • A £100m state-of-the-art Training Centre that supports the Club’s ambition to attract, develop and retain the best talent.
  • Commercial partnerships with globally-recognised brands including AIA Group Limited (AIA), one of the world's leading providers of life insurance services, and Nike, the world’s leading sports footwear and apparel company.
  • A commitment to minimizing its environmental impact across Club operations, being named as the greenest in the Premier League for the past three years. Tottenham Hotspur is a signatory of the UN Sports for Climate Action Framework, committing to halve carbon emissions by 2030 and become net zero carbon by 2040.
  • An award-winning Foundation that is renowned for creating opportunities to help enhance the lives of people in its local community through education, employment, health and social inclusion programmes.

 

The Football Insights Department provides data-derived insights that impact decision-making across football departments, from the First Team to the Academy and including both Men's and Women's teams. Our mission is ensuring that critical processes, from player recruitment to performance optimisation, are consistently informed by thorough, high-quality information. With a focus on building a single source of truth, developing rigorous quantitative models, and delivering effective tools for interrogating data, our intention is to empower stakeholders with insightful statistical analyses that are both timely and actionable. Leveraging data as our fundamental commodity and powered by talent, we are looking to push the cutting edge of football analytics to drive the club towards its ambitious footballing vision.

 

JOB PURPOSE

 

  • To develop and maintain robust, scalable data infrastructure and ETL pipelines that underpin the club's analytics platform, ensuring data from diverse sources is accurately ingested, processed, and made readily available for analysis across all football departments.
  • To continually enhance our technical infrastructure through innovative engineering solutions, focusing on the scalability and reliability of systems that support the club’s data platforms, ensuring they are well-integrated and capable of handling evolving analytical demands.
  • To collaborate effectively with Data Scientists and Analysts to support the production and integration of advanced machine learning models, facilitating the seamless transition of bespoke metrics into insights on player and team performance.

 

KEY RESPONSIBILITIES

 

  • Automate data processes to streamline club operations, identifying and addressing bottlenecks to improve data throughput and quality.
  • Design, develop, and maintain scalable ETL pipelines and data architectures to ensure data accuracy, quality, and security across internal and external sources.
  • Ingest data from heterogenous sources and build validations to assess data quality.
  • Manage data transformation across multiple layers and incorporate data modelling for analytical purposes (e.g., dimensional, and semantic), ensuring data is accurately processed, validated, and enriched to meet analytical requirements and support decision-making processes.
  • Collaborate closely with Data Scientists to productionise machine learning models, integrating bespoke metrics into cleaned and merged datasets to drive strategic insights.
  • Implement and maintain CI/CD pipelines for testing, deployment, and version control in tools like dbt, ensuring the consistent application of best practices in code management.
  • Collaborate with the club's Technology Department to leverage and evolve a modern tech stack, ensuring the seamless integration of data analytics capabilities with the club's broader technological infrastructure.
  • Provide prompt support for time-sensitive data infrastructure issues, ensuring operational continuity during critical periods like match analysis cycles.
  • Uphold our technical best practices and standards, mentoring staff on good programming practices and coding literacy.
  • Maintain rigorous documentation practices to ensure clarity, accuracy, and consistency in data management and pipeline development.
  • Manage key relationships with data vendors and technology partners, ensuring access to timely and relevant data that supports the club’s processes.
  • Contribute to the expansion of the Football Insights department by identifying areas for growth and shaping its future direction and capabilities.
  • Play a pivotal role in the recruitment and onboarding process for data/analytics engineering talent, from identifying potential candidates to advising on hiring decisions, with a focus on building a diverse and skilled team.

 

PERSON SPECIFICATION

 

SKILLS & COMPETENCIES

Essential

  • Master’s or higher degree in a quantitative field (Mathematics, Statistics, Computer Science, or related fields) or equivalent experience working in the software/data industry.
  • 3+ years as a Data/Analytics/Infrastructure/ML Engineer or combined experience with a similar role in the data industry (e.g., Data Scientist).
  • Advanced proficiency in statistical programming languages, especially Python and/or R.
  • Proficient in SQL development and knowledgeable about database and data warehousing technologies.
  • Solid experience with cloud services from major providers (e.g., Azure, AWS, Google Cloud), with a comprehensive understanding of cloud-based data solutions and best practices.
  • Experience with major data warehousing solutions (e.g., Snowflake, BigQuery, Redshift), including their architecture and data modelling capabilities.
  • Proficiency in implementing and managing medallion architecture within cloud data platforms, demonstrating a strong understanding of layer segregation from raw to validated and enriched stages.
  • Extensive experience in integrating diverse data sources using APIs.
  • Experience processing and storing large “out-of-memory” datasets, e.g. using distributed data processing technologies (e.g. Apache Spark) and columnar storage formats.
  • Ability to communicate effectively and with a high degree of empathy to elicit requirements and deliver solutions to stakeholders who may have varying technical backgrounds.

 

Desirable

  • In-depth knowledge of Azure's cloud services and architecture.
  • Experience with Azure Data Factory (ADF) for data ingestion and workflow orchestration.
  • Proficiency in Blob Storage for data storage solutions.
  • Experience using dbt (Data Build Tool) for SQL-based data transformation and modelling.
  • Experience with Snowflake for data warehousing, including its architecture and data modelling capabilities.
  • Strong foundations in software engineering best practices, including version control (e.g., Git), automated testing, and CI/CD processes.
  • Broad working knowledge of BI tools, preferably Tableau, to support analytics use cases.
  • Experience with football data providers, specifically StatsBomb and SkillCorner, for advanced data analysis.

 

PERSONAL ATTRIBUTES

 

  • Passionate about football, with a keen appreciation for the dynamics and pace of elite sports.
  • Demonstrates innovative thinking and exceptional problem-solving skills, consistently pushing the boundaries in data engineering practices.
  • A collaborative team player, committed to building effective relationships and achieving collective success within a multidisciplinary team.
  • Possesses a growth mindset, actively seeking to enhance personal skills and stay updated with the latest advancements in data engineering.
  • Proactively manages project timelines and workload, especially during critical periods of the football season, ensuring timely delivery of solutions.
  • Ambitious and competitive, striving to leverage cutting-edge data engineering techniques to give the club a competitive advantage on and off the pitch.

 

Safeguarding is fundamental to the success in all that we do. Successful candidates are subject to an Enhanced DBS check with child’s barred list.

 

Tottenham Hotspur Football Club welcomes applications from anyone regardless of age, disability, race, or ethnic and national origins, religion or belief, or sexual orientation.