Skip to content
View SasySpanish's full-sized avatar

Block or report SasySpanish

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SasySpanish/README.md

Header wave

MSC in Statistics

Quantitative AI analysis in Risk, Finance, Insurance.

Currently workin on Agentic deployment, Big Data and Cloud

Featured Projects

AI & Automation

Big Data

Data Science & Machine Learning

Data Analysis & Real-World Insights

Highlights

AI Agents & Automation

  • Designed and deployed LLM-powered agents (Groq LLaMA 70B & Ollama Qwen 2.5) for automated financial and revenue analysis
  • Built a Streamlit web app with interactive Plotly dashboards for real-time user-driven analysis
  • Implemented full pipeline: data ingestion (Yahoo Finance) → financial metrics (17+ KPIs) → multi-year trend analysis & CAGR → automated reporting
  • Enabled scalable analysis across 100+ companies and 10+ sectors via prompt-based interaction

Big Data Engineering & Analytics

  • Engineered a PySpark pipeline on Databricks to process large-scale NOAA weather data from AWS S3
  • Implemented data lake architecture (Delta Lake + Parquet) for efficient storage and querying
  • Computed multi-level aggregations (continent, country, city) using Spark & Pandas
  • Delivered insights through scalable visualizations (Matplotlib, Seaborn)

Quantitative Finance & Risk Modeling

  • Developed Value-at-Risk models on Brent Oil using econometric and ML approaches with backtesting validation
  • Demonstrated superior performance of ML models (Boosting) in capturing nonlinear risk dynamics and reducing forecast errors
  • Applied Monte Carlo simulations and classification models in insurance use cases
  • Built customer risk segmentation (PCA + KMeans) identifying underpriced high-risk clusters to support underwriting decisions

Machine Learning & Deep Learning

  • Trained a ResNet-18 (transfer learning) model in PyTorch for image classification
  • Achieved ~98% validation accuracy on a 20k image dataset
  • Evaluated model robustness using confusion matrix and ROC curve

Applied Data Analysis & Automation

  • Built an end-to-end financial data pipeline (yFinance → cleaning → EDA → feature engineering → visualization → reporting)
  • Reduced analysis time by ~50% through automation and reusable workflows
  • Conducted CO₂ emissions and global life expectancy analysis extracting macro-level insights
  • Performed geospatial poverty analysis in Italy (R) using mapping, spatial visualization and regression models to uncover territorial disparities

Footer wave

Pinned Loading

  1. Insurance_Life_Policies Insurance_Life_Policies Public

    Actuarial Simulator for insurance life policies using the official ANIA SIM/SIF 2022 mortality tables (Standard Italian Mortality Tables). It supports both classic and advanced multi-component poli…

    Python

  2. Insurance-Customer-Risk-Segmentation-with-Python Insurance-Customer-Risk-Segmentation-with-Python Public

    This repository contains a complete, end-to-end data pipeline for insurance customer risk segmentation and pricing optimization using synthetic data with python. It includes raw customer data, a re…

    Python

  3. AI-Agent-for-Revenue-Analysis-with-Groq-on-StreamLit AI-Agent-for-Revenue-Analysis-with-Groq-on-StreamLit Public

    A web-based financial analysis application powered by llama-3.3-70b-versatile via Groq, built with Streamlit. Enter a company name, sector keyword, or list of ticker symbols, and the agent automati…

    HTML

  4. NOAA-GSOD-Global-Weather-Analysis-with-PySpark-on-Databricks NOAA-GSOD-Global-Weather-Analysis-with-PySpark-on-Databricks Public

    Cloud-friendly data analysis of global weather data (NOAA GSOD) with PySpark on Databricks, storing in Data Lake — temperature trends, precipitation extremes, heatwaves, anomalies, interactive Plot…

    Jupyter Notebook

  5. AI-Agent-for-Financial-Analysis-Automation-with-Python AI-Agent-for-Financial-Analysis-Automation-with-Python Public

    AI-powered agent made with Python, yFinance, Pandas, Matplotlib, and Ollama (Qwen2.5 model) for automated financial data processing. Handles user prompts to download/clean data, compute technical i…

    HTML

  6. Brent-Oil-VaR-using-Machine-Learning-with-Python Brent-Oil-VaR-using-Machine-Learning-with-Python Public

    This repository contains the analysis and results of a study on the estimation of Value at Risk (VaR) for Brent crude oil. The analysis compares classical VaR methods with advanced Machine Learning…

    Python