alter-text

Lidija Jovanovska

Data Scientist | NLP & Machine Learning Specialist

About Me

I’m a data scientist specialising in building and deploying ML solutions. Since publishing my first research paper in 2019, I’ve been driven by a curiosity to solve complex problems and push the boundaries of data science and AI.

Currently, I’m part of the innovative team at Sportradar, where I focus on building NLP-driven solutions for entity mapping and semantic search. One of my proudest achievements has been deploying a semantic search engine that significantly improved mapping accuracy for various company services. I also get to have a little fun building GenAI products for sports commentary. Prior to this, I honed my skills at a real estate startup, developing machine learning models that boosted price estimation accuracy and strengthened client relationships.

I thrive on the intersection of innovation and impact. Whether it’s deploying machine learning models into production or fine-tuning language models for semantic annotation, I believe in the power of data to transform industries. I’m always eager to learn, explore new techniques, and bring creative solutions to the table. And if you ever want to chat AI, NLP, or how to make machines understand us a little better, feel free to reach out!

Skills

Python
  • Extensive experience with libraries like Pandas, Numpy, Scikit-learn, Pytorch, Plotly, Huggingface libraries
SQL
  • Proficient in writing complex queries and working with relational databases
FastAPI, Django
  • Web development experience building robust APIs
Docker, Kubernetes
  • Containerization and orchestration of ML models in production environments

Natural Language Processing (NLP)
  • Named Entity Recognition (NER), Semantic Search, Text Classification, LLMs
Machine Learning
  • Developing and deploying ML models for tasks such as entity mapping and price estimation
Data Visualization
  • Using Plotly, Matplotlib for creating insightful visualizations to communicate findings

AWS
  • Experience with services such as Redshift, S3, Sagemaker for model training and data handling
MLFlow, Kubeflow
  • Managing machine learning lifecycles, model versioning, and experiment tracking
Git + CI/CD Pipelines

Version control, continuous integration and continuous deployment practices

Kafka

Real-time data processing

Work Experience

Sep 2022 - Present · Ljubljana, Slovenia

Key responsibilities
  • Develop and implement NLP solutions for entity mapping and semantic search
  • Collaborate with cross-functional teams to integrate ML models into production systems
  • Build GenAI products for sport (e.g., match commentary)

Technologies

  • Python, Kubernetes, FastAPI, Docker, SQL, Kafka, AWS (Redshift, S3, Sagemaker), MLFlow, Kubeflow, Git
  • Huggingface, SBert, Pytorch, Pandas, Numpy, Plotly

Achievements

  • Successfully deployed a semantic search engine boasting 98% mapping accuracy
  • Developed a match commentary solution that will be integrated across different services

Sep 2021 - Apr 2022 · Ljubljana, Slovenia

Key responsibilities
  • Developed ML models for real estate price estimation
  • Created and maintained model architecture documentation for external validation processes
  • Built a package for predicting energy efficiency of real estate

Technologies:

  • Python, SQL, Git
  • Django, Pandas, Numpy, Scikit-learn, Plotly

Achievements

  • Improved price estimation accuracy by 10%, leading to increased client satisfaction
  • Contributed to the data and model documentation which helped lock key clients

Oct 2019 - Sep 2021 · Ljubljana, Slovenia

Research Focus:

  • Developing an ontology for AI entities and processes
  • Using semantic technologies to annotate a data corpus (papers, algorithm documentation)
  • Fine-tuning Named entity recognition (NER) models for automatic semantic annotation

Key Contributions:

  • Developed tools for semantic annotation of AI algorithms
  • Collaborated on multiple research projects within the Knowledge Technologies department
  • Co-authored 2 research papers on semantic web technologies

Skills Developed:

  • Advanced machine learning techniques
  • Semantic web technologies (RDF, OWL, SPARQL)
  • Research methodology and scientific writing
  • Collaboration in an academic research environment

Education

International Postgraduate School Jožef Stefan

Faculty of Computer Science and Engineering - Ss. Cyril and Methodius University in Skopje

  • Sep 2015 - Sep 2019
  • Took courses in Discrete Mathematics, Calculus, Probability and Statistics, Computer Vision, Robotics, Network Science, Natural Language Processing, Computer Graphics, Bioinformatics
  • Laboratory assistant for the course Computer Animation
  • Published two research papers

Awards and Conferences

Best Documentation | HAMR Hackathon - ISMIR 2020

  • Analysis of Chord Progression Networks using data from Ultimate Guitar
  • Demo for generating chord progressions using graph traversal methods

Best student paper (3rd place) | CIIT 2019

  • Awarded for the paper The Geographic Flow Of Music On Spotify

Other Experiences

International Society for Music Information Retrieval (ISMIR)

Oct 2020

  • Assisted with online conference management
  • Moderated discussion panels
  • Gained experience in virtual event coordination

Key Skills: Event Management, Online Moderation, Music Information Retrieval

Faculty of Computer Science and Engineering

Jan 2018 - Jun 2018

  • Created comprehensive video tutorials for the Computer Animation course
  • Assisted students during laboratory exercises
  • Managed a Youtube channel with educational content
  • Achieved Certified Autodesk Student Expert status

Key Skills: 3D Animation, Teaching, Content Creation, Autodesk Software

Neotel

Apr 2018 - Jun 2018

  • Designed and manufactured drone parts using 3D printing
  • Assembled a functional drone
  • Experimented with photogrammetry software for 3D object generation

Key Skills: 3D Printing, Drone Assembly, Photogrammetry

Museum of the Macedonian Struggle Feb 2017 - Jun 2017

  • Created an interactive 3D virtual version of a museum exhibition room
  • Utilized Autodesk Maya, Unity, and WebGL for development
  • Published a research paper on the learning environment creation process
  • Contributed to innovative educational tools combining historical accuracy with user engagement

Key Skills: 3D Modeling, Game Development, Educational Technology, Research

Certifications

Note

These certifications demonstrate my commitment to continuous learning and expertise in key areas of data science and machine learning.

  1. Natural Language Processing Specialization | Coursera, January 2023
  • Comprehensive training in NLP techniques, from basic text processing to advanced deep learning models for sequence modeling.
  1. Machine Learning in Production | Coursera, November 2022
  • Advanced course on designing, deploying, and managing ML systems in production environments.
  1. Deep Learning Specialization | Coursera, July 2020
  • In-depth study of neural networks, covering everything from basics to advanced architectures like CNNs and RNNs.
  1. Cambridge English: Advanced (CAE) | Cambridge, June 2014
  • Certification of C1-level English proficiency, demonstrating advanced communication skills in professional and academic contexts.


Tip

For more information and to see my latest projects, please visit my GitHub profile

Want to get in touch? Contact me