alter-text

Lidija Jovanovska

Senior Machine Learning Engineer

About Me

I’m an engineer specialising in building and deploying ML solutions. Since publishing my first research paper in 2019, I’ve been driven by a curiosity to solve complex problems and push the boundaries of data science and AI.

Currently, I’m part of the innovative team at Sportradar, where I’m building generative models for sports. Using transformer-based models we were able to model the game of basketball via a custom grammar. This helps us generate new, unseen games and predict what will happen next for real games. Previously, I built an NLP-driven solution for entity mapping and semantic search which significantly improved mapping accuracy for our data products. I’ve also dabbled with agents - building the football commentary agent used in Google’s timeline overview.

I thrive on the intersection of innovation and impact. Whether it’s deploying machine learning models into production or fine-tuning language models for semantic annotation, I believe in the power of data to transform industries. I’m always eager to learn, explore new techniques, and bring creative solutions to the table. And if you ever want to chat AI, NLP, or how to make machines understand us a little better, feel free to reach out!

Achievement highlights

Video thumbnail for shot probability on YouTube
Shot probability feature video

🔖 Peacock Performance View feature now available for all NBA games on Peacock

Google timeline view showing football commentary
Google timeline view with football commentary integration

🔖 Google match timeline (Liverpool vs Galatasaray)

Work Experience

Sep 2022 - Present · Ljubljana, Slovenia

Key responsibilities
  • Design and deploy models for generative sports gameplay modelling and event prediction
  • Develop and implement NLP solutions for entity mapping and semantic search
  • Collaborate with cross-functional teams to integrate ML models into production systems
  • Build GenAI products for sport (e.g., match commentary, sports insights)

Technologies

  • Python, Kubernetes, FastAPI, Docker, SQL, Kafka, AWS (Redshift, S3, Athena, Sagemaker), MLFlow, Git, Grafana
  • Huggingface, SBert, Pytorch, Pandas, Numpy, Plotly

Achievements

  • Developed the shot probability model deployed in Peacock Performance View watched by millions of NBA fans in the US
  • Built the football match commentary prototype, now deployed in Google timeline view
  • Successfully deployed a semantic search engine boasting 98% mapping accuracy for sports entities

Sep 2021 - Apr 2022 · Ljubljana, Slovenia

Key responsibilities
  • Developed ML models for real estate price estimation
  • Created and maintained model architecture documentation for external validation processes
  • Built a package for predicting energy efficiency of real estate

Technologies:

  • Python, SQL, Git
  • Django, Pandas, Numpy, Scikit-learn, Plotly

Achievements

  • Improved price estimation accuracy by 10%, leading to increased client satisfaction
  • Contributed to the data and model documentation which helped lock key clients

Oct 2019 - Sep 2021 · Ljubljana, Slovenia

Research Focus:

  • Developing an ontology for AI entities and processes
  • Using semantic technologies to annotate a data corpus (papers, algorithm documentation)
  • Fine-tuning Named entity recognition (NER) models for automatic semantic annotation

Key Contributions:

  • Developed tools for semantic annotation of AI algorithms
  • Collaborated on multiple research projects within the Knowledge Technologies department
  • Co-authored 2 research papers on semantic web technologies

Skills Developed:

  • Advanced machine learning techniques
  • Semantic web technologies (RDF, OWL, SPARQL)
  • Research methodology and scientific writing
  • Collaboration in an academic research environment

Skills

Python
  • Extensive experience with libraries like Pandas, Pytorch, Numpy, Polars, Scikit-learn, Plotly, Huggingface libraries
SQL
  • Proficient in writing complex queries and working with relational databases
FastAPI, Django
  • Web development experience building robust APIs
Docker, Kubernetes
  • Containerization and orchestration of ML models in production environments

Natural Language Processing (NLP)
  • LLMs, Named Entity Recognition (NER), Semantic Search, Text Classification
Machine Learning
  • Developing and deploying ML models for tasks such as shot outcome prediction, entity mapping and price estimation
Data Visualization
  • Using Plotly, Matplotlib for creating insightful visualizations to communicate findings

AWS
  • Experience with services such as Redshift, S3, Sagemaker for model training and data handling
MLFlow, Kubeflow
  • Managing machine learning lifecycles, model versioning, and experiment tracking
Git + CI/CD Pipelines

Version control, continuous integration and continuous deployment practices

Kafka

Real-time data processing

Education

International Postgraduate School Jožef Stefan

Faculty of Computer Science and Engineering - Ss. Cyril and Methodius University in Skopje

  • Sep 2015 - Sep 2019
  • Took courses in Discrete Mathematics, Calculus, Probability and Statistics, Computer Vision, Robotics, Network Science, Natural Language Processing, Computer Graphics, Bioinformatics
  • Laboratory assistant for the course Computer Animation
  • Published two research papers

Awards and Conferences

Best Documentation | HAMR Hackathon - ISMIR 2020

  • Analysis of Chord Progression Networks using data from Ultimate Guitar
  • Demo for generating chord progressions using graph traversal methods

Best student paper (3rd place) | CIIT 2019

  • Awarded for the paper The Geographic Flow Of Music On Spotify

Other Experiences

International Society for Music Information Retrieval (ISMIR)

Oct 2020

  • Assisted with online conference management
  • Moderated discussion panels
  • Gained experience in virtual event coordination

Key Skills: Event Management, Online Moderation, Music Information Retrieval

Faculty of Computer Science and Engineering

Jan 2018 - Jun 2018

  • Created comprehensive video tutorials for the Computer Animation course
  • Assisted students during laboratory exercises
  • Managed a Youtube channel with educational content
  • Achieved Certified Autodesk Student Expert status

Key Skills: 3D Animation, Teaching, Content Creation, Autodesk Software

Neotel

Apr 2018 - Jun 2018

  • Designed and manufactured drone parts using 3D printing
  • Assembled a functional drone
  • Experimented with photogrammetry software for 3D object generation

Key Skills: 3D Printing, Drone Assembly, Photogrammetry

Museum of the Macedonian Struggle Feb 2017 - Jun 2017

  • Created an interactive 3D virtual version of a museum exhibition room
  • Utilized Autodesk Maya, Unity, and WebGL for development
  • Published a research paper on the learning environment creation process
  • Contributed to innovative educational tools combining historical accuracy with user engagement

Key Skills: 3D Modeling, Game Development, Educational Technology, Research

Certifications

Note

These certifications demonstrate my commitment to continuous learning and expertise in key areas of data science and machine learning.

  1. Natural Language Processing Specialization | Coursera, January 2023
  • Comprehensive training in NLP techniques, from basic text processing to advanced deep learning models for sequence modeling.
  1. Machine Learning in Production | Coursera, November 2022
  • Advanced course on designing, deploying, and managing ML systems in production environments.
  1. Deep Learning Specialization | Coursera, July 2020
  • In-depth study of neural networks, covering everything from basics to advanced architectures like CNNs and RNNs.
  1. Cambridge English: Advanced (CAE) | Cambridge, June 2014
  • Certification of C1-level English proficiency, demonstrating advanced communication skills in professional and academic contexts.


Tip

For more information and to see my latest projects, please visit my GitHub profile

Want to get in touch? Contact me