MeetAlexis Balayre
Who am I?
Name: Alexis Balayre
Nationality: French
Location: Paris, France
Degrees: ISEP Engineering Master's Degree | MSc in Computational and Software Techniques in Engineering, Cranfield University
Specialisations: Real-Time Speech AI | Generative AI | Agentic Systems | Machine Learning | Software Engineering
Role: AI Engineer
AI Engineer specialising in real-time speech AI, with a dual background in Software Engineering and Data Science. I design, build, and operate production systems that turn applied research into scalable products, currently focused on low-latency speech translation at Acolad. Beyond speech, I work across agentic systems, LLM and RAG applications, and applied machine learning, with a track record of taking prototypes into production. I'm particularly passionate about AI security, which I see as one of the most critical challenges ahead. My goal is to build AI that is not only capable, but secure and trustworthy for the people and organisations who depend on it.
Experiences

AI Engineer at Acolad
Sept. 2025 - PresentAcolad is a global leader in language and content solutions, present in 25 countries. As an AI Engineer, I work on Lia Live AI, our real-time AI interpreting platform, owning features end to end from applied speech research to production infrastructure.
- Build real-time speech translation pipelines (streaming ASR, LLM translation, streaming TTS) with sub-second latency across 30+ languages.
- Engineer the infrastructure behind live multilingual sessions over WebRTC and telephony, with failover on Kubernetes for zero-downtime scaling.
- Develop agentic features that surface clarifications and cultural context, grounding translations in client glossaries and reference documents.
- Benchmark speech AI providers on quality, latency, and cost via purpose-built evaluation harnesses, driving per-language production routing.
- Run the service in production: incident diagnosis, observability, security hardening, and provider cost optimisation.
- Advance the platform's next generation: speech model fine-tuning, inference optimisation, and on-device deployment.
Data Scientist at Dassault Systèmes
Jan. 2025 - Jul. 2025Dassault Systèmes, the 3DEXPERIENCE Company, empowers businesses and individuals with virtual universes to foster sustainable innovation. As a Data Scientist, I worked in the Industry Market Intelligence - Corporate Strategic Planning team to automate and optimise data processes, ultimately driving strategic insights and enabling informed decision-making.
- Accelerated strategic reporting and Market Intelligence by automating data workflows and analysing large-scale datasets (millions of records) using Python and SQL.
- Designed and implemented data-driven solutions leveraging Web Scraping, Natural Language Processing (topic modelling, sentiment analysis), and Generative AI (LLM, RAG, GraphRAG) to extract insights from both structured and unstructured data.
- Built and maintained Competitive Intelligence and monitoring tools to support executive decision-making and strengthen Dassault Systèmes' positioning in its Business sectors.
Data Engineer Freelance at Sidetrade
Oct. 2024 - Nov. 2024Sidetrade is a leading AI-powered SaaS platform that helps companies enhance working capital and cash flow performance by optimising customer relationship management and automating financial processes.
- Defined the architecture and data workflows for an automated carbon footprint reporting solution, aimed at reducing manual work and improving reporting speed.
- Designed the integration of data from NetSuite, Egencia, Eurécia, and Navan into Power BI to enable dynamic, real-time dashboards with exportable insights.
- Proposed an ETL solution using Microsoft Fabric to streamline the processing of both structured and unstructured data.
- Project was discontinued during implementation due to data privacy constraints related to GDPR compliance.
Research Student at Cranfield University
May 2024 - Sept. 2024Master's Thesis in collaboration with Airbus, supported by UK Research and Innovation (UKRI) and the Aerospace Technology Institute (ATI), focused on enhancing the efficiency and accuracy of automated ground refueling systems for Smart Airports. Developed an advanced machine learning framework combining the object detection model YOLOv10 and a proposed deep learning sequence model (SizPos-GRU) to accurately predict the future position of commercial aircraft's refueling port in a video stream.

Vice President of the Blockchain Lab at Garage ISEP
Sept. 2022 - Sept. 2023Garage ISEP is the student association of ISEP dedicated to technology (Blockchain, Artificial Intelligence, ...). Organisation of workshops and conferences to learn and master Blockchain technologies.

Blockchain Developer at CoinShares
Sept. 2022 - Jan. 2023CoinShares is Europe's largest and oldest crypto asset investment company, managing billions of dollars in assets. Worked within the DeFi team to build a decentralised asset management tool, gaining hands-on experience in Decentralised Finance.
Projects
Claude Code Power Config: Agentic Development Framework
An opinionated, production-ready configuration framework for Claude Code that codifies a multi-agent, worktree-first development workflow. It bundles always-on project memory (CLAUDE.md), path-scoped convention rules, auto-discoverable skills, a multi-agent PR review command, isolated domain subagents (conventions, migrations, security, architecture), and zero-LLM shell hooks for git safety and quality gates. Language-agnostic, demonstrated on a TypeScript monorepo. It reflects how I now plan and build software, using Gemini and Claude to plan and then drive implementation with Claude Code.
AuraHelpdeskGraph: GraphRAG Support Chatbot
A support chatbot that leverages local LLMs, vector similarity search, and knowledge graphs to provide contextual assistance by finding and presenting solutions from historical support tickets.
RagDocs: RAG-Powered Chatbot for Technical Documentation
RagDocs is an open-source tool for querying technical documentation using local LLMs. It indexes documents into a Milvus vector store and answers questions with Retrieval-Augmented Generation (RAG), so documentation stays private and queries run without external API costs. Built with Next.js, FastAPI, LlamaIndex, and Ollama, and deployable via Docker.
Master's Thesis: Future Position Prediction for Pressure Refuelling Port of Commercial Aircraft:
This thesis develops a deep learning framework to predict the future position of aircraft refuelling ports, enhancing automated refuelling systems. The approach integrates a fine-tuned YOLOv10 model for detection with the proposed 'SizPos-GRU' sequence model, which captures temporal and spatial relationships from video frames using an encoder-attention-decoder architecture. Results show that the SizPos-GRU model outperforms other models, achieving an Average Displacement Error (ADE) of 4.28% and a Final Displacement Error (FDE) of 9.18% when using 30 past frames to predict 60 future frames. For predictions using 15 past frames to predict 30 future frames, the model achieved an ADE of 2.15% and an FDE of 4.83%. These results demonstrate significant improvements in prediction accuracy, proving the framework’s effectiveness in automating and enhancing aircraft refuelling operation.
GPTAggregator
GPTAggregator is a Python-based application that provides a unified interface for interacting with various large language models (LLMs) via their respective APIs. The project is designed to be user-friendly and easily extensible, making it a powerful tool for developers, researchers and anyone interested in exploiting the capabilities of large language models. GPTAggregator makes it possible to switch seamlessly from one model to another within the same conversation, centralise conversation storage, automatically optimise messages, and much more.
AI-Powered Meeting Summarizer
The AI-Powered Meeting Summarizer is a Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.cpp for audio-to-text conversion and Ollama for text summarisation. This tool is ideal for quickly extracting key points, decisions, and action items from meetings.
Group Project: Deep Learning for Turbulence Modelling
This project focuses on the innovative integration of machine learning with computational fluid dynamics (CFD) to address the limitations of traditional turbulence models. By incorporating Physically Informed Neural Networks (PINNs) and employing the Sparse Identification of Nonlinear Dynamical Systems (PySINDy) approach, we aim to improve the accuracy and efficiency of turbulent flow simulations. This interdisciplinary effort aims to harness the power of high-performance computing to revolutionise the predictive modelling of fluid dynamics.
Chest X-Ray Abnormalities Detection with Faster R-CNN
This project leverages the Faster R-CNN model with a ResNet-50 backbone, implemented using PyTorch Lightning, for the detection and localisation of thoracic abnormalities in chest X-ray images.
COVID-19 Data Analysis Project Using Apache Spark
This project harnesses Big Data and Machine Learning technologies to analyse the global impact of the COVID-19 pandemic. Utilising Apache Spark, it processes extensive datasets of COVID-19 cases, uncovering insights into the virus's spread and effects.
Cloud Computing and IoT for Environmental Monitoring Project
This project integrates Cloud Computing with the Internet of Things (IoT) for comprehensive environmental monitoring, with a focus on air quality analysis. Utilising Apache Spark and Amazon Timestream, it manages large volumes of data from IoT sensors to calculate the Air Quality Index (AQI) accurately, offering insights into environmental conditions.
PriceSense: Correlating News Sentiment with Stock Prices
This project examines the relationship between financial news sentiment and stock market prices using two data pipelines. The first pipeline uses LocalStack to mimic Amazon S3 for data storage and messaging, while the second utilises Apache Kafka for real-time streaming and message exchange. Both pipelines process financial news and stock price data, calculate sentiment scores, merge this data, and then index it into Elasticsearch for analysis and visualisation.
Skills
Programming Languages
Generative AI & LLMs
Speech & Real-Time AI
AI & Data Intelligence
AI Dev Workflow
Databases
Data Engineering Tools
Frontend
Backend
Hosting & Cloud Services
Server Management & DevOps
Other Tools
Hackathons

ETHGlobal ETHOnline 2023 Winner
Oct. 2023ETHOnline is a three week long online hackathon with over $200,000 USD in prizes as well as a series of curated summits, celebrating some of the most significant happenings & learnings from the past 12 months in web3. Enigma introduces the Secret NFTs standard: a unique dual URI approach. Each NFT showcases a public and a private layer. Only owners have full access. From art to memberships, prioritising privacy in the digital domain is paramount. Enigma won 2 prizes at the 2023 ETHGlobal ETHOnline hackathon : "The Graph — Best New Subgraph or Substream" and "Scroll — Pool Prize".

Chainlink Spring 2022 Winner
May 2022Hackathon to build the next generation of Web3 apps with $500K+ in prizes. Winner of Chainlink Top Quality Projects.

ETHGlobal HackMoney 2022 Winner
May 2022World's largest DeFi hackathon. Sleepn is a decentralised sleep tracking application that allows you to earn money by sleeping. Sleepn won 2 prizes at the 2022 ETHGlobal HackMoney hackathon : "Polygon — Best App" and "Superfluid — Best Use With Another Partner".

Finalist in Devoteam's Cyber Hackathon
Oct. 2021 - Nov. 2021Finalist in the 'Devogame' cybersecurity competition organised by Devoteam out of more than 3,800 entrants. Proposed a system for verifying a company's logs using a private blockchain and a consensus system based on private keys.
Education

Cranfield University - MSc in Computational and Software Techniques in Engineering
Sept. 2023 - Sept. 2024MSc at Cranfield University as part of a double degree programme with ISEP.
Cranfield University, located in the UK, is a distinguished public institution exclusively dedicated to postgraduate education and research. It boasts a specialisation in science, engineering, technology, and management, holding the unique distinction of being the UK's sole postgraduate-only university.
Modules studied:
- Machine Learning and Big Data
- Artificial Intelligence
- High Performance Technical Computing
- Cloud Computing
- Small-Scale Parallel Programming
- Management for Technology
ISEP - Engineering Master Degree
2021 - 2024Engineering cycle at ISEP in Paris. Specialisation in Data Intelligence.
ISEP is a French digital engineering school located in Paris. It trains general engineers in the key fields of digital technology: IT & Cybersecurity - Electronics & Robotics - Telecommunications & Internet of Things - Imaging - Artificial Intelligence.
CVUT - Academic Semester Abroad
Feb. 2021 - Jun. 2021Academic semester in English at the Czech Technical University in Prague.
Modules studied:
- Telecommunication Systems and Networks
- Introduction to Cyber Security
- Environmental Engineering
- Discrete mathematics
- Programming
- English
ISEP - International Integrated Cycle (CII)
2019 - 2021Preparatory years at the ISEP engineering school in Paris.
The CII follows a programme specific to ISEP, balanced across three teaching areas: Fundamental Digital Sciences | Techniques & Technologies | Languages, Cultures, Humanities and Sustainable Development. It includes an international semester from the second year.
