Languages
Python, Java, R, C/C++, PL/SQL
IDE
Anaconda, Eclipse, VS Code, RStudio
Hello I'm
Kpodjro
Data Scientist/ Machine Learning Engineer
Machine Learning Engineer with two experiences developing and optimizing LLM models.
Expertise in data mining, managing question-answering systems, and fine-tuning language models,
having improved prediction scores by 23% during a recent internship. Able to transform complex
analyses into practical solutions.
Ready to bring NLP and machine learning skills to meet business needs.
I am currently pursuing my master's degree in Data Science with a specialization in machine
learning algorithm development, data analysis and decision support system development at Paris
Cité University in Paris.
ATTIJARIWAFA BANK
March 2024 - August 2024
Technologies :
Mistral 7b, Langchain, torch, streamlit, VSCode, HuggingFace, MongoDB(NoSQL), Jira
MLView Consulting
August 2023 - September 2023
Technologies :
Langchain, HuggingFace, LLAMA-2, tensorflow, VSCode, Colab, Git&Github, SQL
Since September 2024 :
MSc's degree in Machine Learning for Data Science, Paris Cité University, Paris France Major field program2021 - 2024 :
Engineering degree in Software and intelligent Systems, Abdelmalek Essaadi University, Tangier, Morocco Major field program2019 - 2021 :
Associate's Degree in Mathematics,Computer Science and physics, Hassan 1st University, Settat Morocco Major field program
Discover my
Research project (in a team of 4 students) :
10/2024 - 05/2025
Abstract : Enhanced movie recommendations using LLMs (Gemini-1.5, Mistral) to enrich user/item profiles, significantly improving accuracy in LightGCN, MLP, and Matrix Factorization by addressing data sparsity and enabling nuanced personalization. Focused on responsible integration, acknowledging challenges like bias and cost.
Main tasks :
Research project (in a team of 4 students) :
01/2025 - 02/2025
Abstract: This paper explores the use of TreeTagger to accurately identify the different functions of the word "that" in English, such as conjunction, relative pronoun, determiner, or adverb. We first evaluate pre-trained models from the BNC and Penn corpora on a test dataset, then re-train TreeTagger with specific labels derived from the Brown corpus to enhance accuracy. Comparisons with other tools like Stanza and UDpipe are also presented. The main findings demonstrate that re-training with the Brown corpus significantly improves the tool’s performance and ability to distinguish among the various uses of "that".
Main Tasks:
Explore My
Python, Java, R, C/C++, PL/SQL
Anaconda, Eclipse, VS Code, RStudio
Pandas, Numpy, statsmodels, sklearn, Pyspark
sklearn, TensorFlow, Keras, pytorch
Supervised,
Unsupervised, Reinforcement, Ensemble Learning
CNN, RNN, LSTM, ANN, TensorFlow, Keras,GNN
OpenCV, Tesseract, KerasCV, pillow
Bert, KerasNLP, LLMAMA-2, Mistral
Statistical Modeling, Dashboard Development
MySQL, PostgreSQL, MongoDB, Oracle, Hive
HTML5 & CSS3, Streamlit, Flask, FastAPI, Shiny, Angular (Beginner)
scrapy, BeautifulSoup, Selenium, pytrend
Git & GitHub
Docker
Airflow, cron(Linux), AWS, GCP, Vertex AI
Gantt Project, Jira
Scientific Document Preparation
Data Visualization, Dashboard Development
Browse my
Sklearn, tensorflow, Keras, matplotlib,SQL
sklearn, seaborn, xgboost, lightgbm
Kafka Stream, PySpark, Sklearn, Flask, Angular, Docker,SQL
Sklearn, OpenCV, flask
Langchain, Streamlit, FAISS, LLMAMA-2
My
Date of issue: 09/2022
Organism : Huawei
Date of issue: 10/2023
Organism : OpenCV University
Date of issue: 10/2023
Organism : Nasa Space Challenge
Date of issue: 09/2023
Organism : Kaggle
Date of issue: 10/2023
Organism : Kaggle
Get in touch
Copyright © 2025 Kpodjro KPATOUKPA. All Rights Reserved.