Skip to content
View OYanEnrique's full-sized avatar

Highlights

  • Pro

Block or report OYanEnrique

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
OYanEnrique/README.md

Oh! A wild Data Scientist appeared!

Not a Pokémon 😅, my name is Yan Enrique and I'm a Data Scientist from Brazil...

Yan Enrique | Data Scientist & Machine Learning Engineer

"Data is the new oil, but intelligence is the new refinery."

About me

✨ My Mission: From Pixels to Predictions

For 7 years, my world was defined by pixels. As a freelance Pixel Artist with over 600 completed projects and a 4.8-star rating from over 380 reviews, I learned a fundamental truth: a single misplaced pixel can ruin an entire composition. Today, I apply that exact same obsessive attention to detail to my daily routine as a Data Scientist.

Instead of placing colors on a canvas, my main brush is Python. My day-to-day involves getting my hands dirty with raw, unstructured data—cleaning and transforming it using Pandas and NumPy to ensure every 'pixel' of information is perfectly aligned. I craft visual narratives through Exploratory Data Analysis (EDA) using Matplotlib and Seaborn, and I engineer predictive models with Scikit-learn, XGBoost, and Tensorflow.

Whether I'm meticulously tuning hyperparameters, fine tuning a model, handling an outlier, processing natural language with NLTK/spaCy, or designing an intuitive dashboard, the bridge between my past and present remains the same: I build functional, precise, and visually compelling solutions.


🧠 My Tool Palette

  • Programming Languages: Python R SQL JavaScript Dart HTML5 CSS3
  • Data Manipulation & Analysis: Pandas NumPy PySpark SciPy Pingouin Statsmodel
  • Machine Learning & AI: Scikit-learn XGBoost PyTorch Hugging Face NLTK spaCy Scikit-image
  • Data Visualization & BI: Matplotlib Seaborn Tableau Looker Studio Streamlit Excel
  • Design & Workflow: Git Figma UX/UI Design Scrum Kanban Data Storytelling

📊 My Project Portfolio

This is where the magic happens. Explore my projects to see how I build bridges between questions and answers.

➡️ Access My Portifolio Page here ⬅️

Featured Machine Learning & Data Projects:

  • 📈 E-commerce Purchase Intent Prediction: End-to-End Data Science Project: From E-commerce Browsing Behavior Analysis to the Deployment of a Conversion-Focused Predictive Model.
  • 📈 Retrieval-Augmented Generation for Digital Humanities: A Python and Streamlit-based academic assistant designed to query and answer questions from a database of Digital Humanities articles and citations. Developed to organize my master's degree readings, the system leverages state-of-the-art LLMs for embeddings and text generation, ensuring accurate answers with proper source attribution.
  • 📈 Credit Card Clustering & Segmentation: Credit card customer segmentation using Unsupervised Machine Learning (K-Means), optimized via Optuna and interpreted with Explainable AI (SHAP).
  • 📈 Credit Card Customer Segmentation API: This API provides endpoints for real-time inference, classifying credit card customers into specific clusters (personas) based on their financial data. Additionally, it integrates with a Streamlit interface, allowing end-users to intuitively test the predictive model live on Hugging Face Spaces.
  • 📈 Bank Intent Classifier — Fine-tuning DistilBERT on Banking77: An end-to-end NLP project that classifies customer banking queries into fine-grained intent categories. The pipeline compares a classic machine learning baseline against a fine-tuned DistilBERT transformer, with all experiments systematically tracked using MLflow. For production deployment, the system is fully containerized with Docker, featuring a modular FastAPI backend for model inference and an interactive Streamlit web interface deployed live on Hugging Face Spaces.
  • 📈 Korean Bakery Sales: This project showcases a full-year sales analysis of a bakery, based on 10,840 cleaned and structured transactions. The main goal was to turn raw data into clear visual insights, using UX/UI design principles to improve readability.

Other Creative & Tech Projects:

  • 🌐 EconStudy: Interactive web platform for studying Economics subjects. Transforms lecture notes into a searchable knowledge base with 30+ structured concepts.
  • 🎮 Candy Mayhem: A turn-based text RPG where candy-themed creatures battle in a layer cake dungeon. Befriend them, fight rivals and explore the delicious universe.
  • 📄 UX Briefing: A simple command-line tool built with Python that integrates AI to helps you quickly create a structured project briefing by asking a series of key questions.
  • 📝 To-Do List: A command-line To-Do List application in Python that saves tasks to a text file.

📫 Let's Connect

Ready to transform your raw data into refined intelligence?

Let's build something memorable together.

My Projects are created with...

python logo R logo SQL logo javascript logo dart logo html5 logo css3 logo pandas logo numpy logo pyspark logo scipy logo scikit-learn logo pytorch logo hugging face logo matplotlib logo streamlit logo tableau logo looker studio logo excel logo git logo vscode logo figma logo antigravity logo statsmodel logo Pingouin logo

Snake animation

Pinned Loading

  1. google-stock-analysis google-stock-analysis Public

    Projeto de análise de dados end-to-end sobre o histórico de ações do Google (GOOG), utilizando Python (Pandas) para ETL e engenharia de features, e Looker Studio para a criação de um dashboard inte…

    Jupyter Notebook 1

  2. netflix-data-analysis netflix-data-analysis Public

    Análise exploratória de dados do catálogo da Netflix (2008-2021) utilizando Python (Pandas, Seaborn) e um dashboard interativo no Looker Studio.

    Jupyter Notebook 1

  3. korean_bakery_sales korean_bakery_sales Public

    [PT] Análise de vendas de uma padaria com foco em UX/UI, visualizações de dados e insights acionáveis a partir de 10.840 transações. [EN] Bakery sales analysis with a UX/UI approach, data visualiza…

    Jupyter Notebook 1

  4. ice-cream-revenue-prediction ice-cream-revenue-prediction Public

    A data analysis project that uses Linear Regression to predict ice cream revenue based on temperature. Includes a Jupyter/Google Colab Notebook, dataset, and a Looker Studio dashboard.

    Jupyter Notebook 1

  5. econstudy-ufrrj econstudy-ufrrj Public

    📊 Plataforma web interativa para estudo de História Econômica Geral e Macroeconomia 1 (UFRRJ). Transforma anotações de aula em base de conhecimento pesquisável com 30+ conceitos estruturados.

    CSS 1

  6. google-capstone-cyclistic-conversion google-capstone-cyclistic-conversion Public

    Projeto Capstone Google Data Analytics | Estudo de caso real usando R para identificar padrões comportamentais entre 600k+ viagens de bike-share, segmentar usuários casuais vs membros anuais, e ger…

    R 1