Skip to content
View Machoub's full-sized avatar

Highlights

  • Pro

Block or report Machoub

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Machoub/README.md

👤 Mehdi Adel Achouba (“Machoub”)

Welcome to my GitHub profile! I’m a 42 Paris student and developer with a background in finance, Devops, and programming. Below you’ll find a quick overview of who I am, what I’ve built, and where to find my work.


📖 About Me

Highly versatile professional with 5+ years in financial control and deep expertise in SQL/Excel reporting. I now build upon this foundation with proficiency in Machine Learning (Python/TensorFlow) for predictive analytics. My technical rigor extends to C++ and Docker for robust system design and deployment, creating a unique data-to-system profile.


🛠️ Skills & Technologies

  • Languages

    • Python (advanced – Data Science, ML, scripting)
    • C / C++ (algorithmic, system programming)
    • TypeScript / JavaScript (backend Fastify, web logic)
    • SQL (PostgreSQL, data analytics & financial reporting)
    • Bash / Shell scripting
  • Frameworks & Libraries

    • TensorFlow, PyTorch, Scikit-learn (machine learning & deep learning)
    • Pandas, NumPy, Matplotlib, PySpark (data analysis & ETL)
    • Fastify (Web)
  • Tools & Platforms

    • AWS (S3,Lambda,step,glue)
    • Snowflak (snowpipe)
    • Docker & Docker Compose (DevOps & containerization)
    • Kubernetes (K3s/K3d) & GitOps (Argo CD, GitLab CI/CD)
    • Git & GitHub (version control & collaboration)
    • PostgreSQL / PgAdmin (data storage & visualization)
    • Linux, GDB Debugger
    • Excel & Power BI (data visualization & KPIs)
  • Areas of Expertise

    • Data Engineering & Analytics: ETL pipelines, PostgreSQL, PySpark, Power BI
    • Machine Learning & AI: regression, classification, MLPs, model evaluation (Scikit-learn / TensorFlow)
    • DevOps: CI/CD pipelines, container orchestration (Docker, K3s)
    • Financial Analysis: automation, KPIs, SQL-based reporting
    • Software & Game Logic: Pong AI, matchmaking system, 2FA auth

📂 Highlighted 42 Projects

Each project has its own repository—click the links to explore code, build instructions, and detailed READMEs.

  • adzuna_spark_aws_snowflake_etl
    Building an End-to-End ETL Pipeline with AWS, Snowflake, Apache Spark using Adzuna public API

  • Bootcamp_neural_network
    A 3-day deep learning bootcamp at 42 where I learned to build and train CNNs and RNNs using TensorFlow and Keras. The program focused on image and sequence data, covering key architectures like convolutional and recurrent neural networks.

  • First_ETL_Spark
    This mini-project focuses on establishing a Spark Session using PySpark in Python to perform a basic distributed ETL pipeline. It demonstrates reading data, applying transformations (e.g., filtering, aggregation), and loading the results to a target location.

  • ft_linear_regression
    project from 42 School is an introductory machine learning assignment. It focuses on implementing univariate linear regression from scratch to predict numerical values based on a single input feature. Typically using the Gradient Descent algorithm to find the optimal model parameters.

  • DSLR
    A machine learning project that implements logistic regression from scratch to classify students into Hogwarts houses based on their features. It includes data analysis, visualization, and a modular one-vs-all classifier for multi-class prediction.

  • Piscine Data Science
    Intensive 42 bootcamp covering the full data pipeline:
    Data Engineer – Dockerized PostgreSQL + pgAdmin, fast ETL from CSVs directly in DB.
    Data Warehouse – Data merging, cleaning, deduplication, optimized SQL workflows.
    Data Analyst & Advanced DS – Statistical analysis, clustering, supervised/unsupervised ML, and data visualization.

  • Python_for_data_science Intro to data science using Python: numpy, pandas, datatable, matplotlib, and image processing with array. Includes OOP principles and Data-Oriented Design (DOD) for analysis and visualization projects.

  • Ft_transcendence Real-time Pong game with tournaments, Google OAuth2 authentication, and user profiles. Built with Fastify (Node.js), Vite (TypeScript), PostgreSQL, and Docker in a microservices architecture.

  • Ftl_quantum
    Introduction to quantum computing with Python and Qiskit (IBM). Covered quantum states, gates, and noise mitigation, and implemented key algorithms such as Deutsch–Jozsa, Simon, Grover, and Shor. Ran circuits on both simulators and real IBM Quantum devices.

  • Ft_IRC
    An IRC server developed in C++98, following the IRC Protocol RFC 1459, and designed to handle multiple simultaneous client connections via TCP/IP in non-blocking mode.

  • CPP_MODULES
    A single repository containing all C++ modules (00 through 09) from the 42 curriculum, covering basics, OOP, templates, exception handling, file I/O, design patterns, memory management, and concurrency.

  • Born2beroot
    System administration basics on Debian/VirtualBox. Encrypted LVM, UFW, strong password policies, strict sudo, and a custom monitoring.sh. Grade: 125%.

  • Libft
    Reimplementation of standard C library functions in C:

    1. Part 1 – Libc Functions (e.g., ft_strlen, ft_memcpy, ft_strchr, ft_atoi, etc.)
    2. Part 2 – Utility Functions (ft_substr, ft_strjoin, ft_split, etc.)
    3. Bonus – Linked List (ft_lstnew, ft_lstadd_back, ft_lstsize, etc.)
  • ft_printf
    Custom implementation of the standard printf function in C. Supports %c, %s, %p, %d, %i, %u, %x, %X, and %% with formatted output using only write.

  • Get Next Line
    Read one line at a time from a file descriptor. Handles any BUFFER_SIZE and supports multiple file descriptors in the bonus.

  • Pipex
    Reproduce UNIX piping: < infile cmd1 | cmd2 > outfile. Uses fork(), pipe(), dup2(), and $PATH.

  • Push_Swap
    Sorts a stack with the minimal number of allowed operations (sa, pb, ra, …). Focus on greedy algorithms and linked-list management.

  • So_Long
    A 2D collectible game using MiniLibX. Load a .ber map, collect items, and reach the exit. Handles keyboard input and real-time rendering.

  • Philosophers
    Dining Philosophers simulation with threads and mutexes. Avoids deadlock and starvation, enforces timing constraints.

  • Minishell
    A minimal shell with custom parsing, built-in commands (cd, echo, export, etc.), pipes, redirections, and signal handling.

  • Cub3D
    A Wolfenstein-style raycaster using MiniLibX. Parse a .cub file, render walls, floors, ceilings, and sprites in 3D.

  • NetPractice
    Browser-based network simulator: configure IPv4 addresses, subnets, gateways, and routes to ensure connectivity between hosts.

  • Inception
    Docker-based multi-container setup (Nginx, MariaDB, WordPress) as part of 42 curriculum’s Inception project.


💼 Professional Experience (Management Control)

During over 5 years in management control:

  • Built Excel models and dashboards to monitor KPIs.
  • Wrote and optimized SQL queries for data extraction, transformation, and reporting.
  • Automated repetitive finance tasks to improve accuracy and efficiency.

📈 GitHub Stats

Machoub’s GitHub stats


📬 Contact

Feel free to reach out if you have any questions or want to collaborate:

Thanks for visiting my profile!

Pinned Loading

  1. Python_for_data_science Python_for_data_science Public

    Python

  2. bootcamp_python bootcamp_python Public

    Forked from 42-AI/bootcamp_python

    Bootcamp to learn Python for Machine Learning

    TeX

  3. Machoub Machoub Public

    Config files for my GitHub profile.