#

sparksession

Here are 5 public repositories matching this topic...

CamilaJaviera91 / pyspark-first-approach

This code demonstrates how to integrate PySpark with datasets and perform simple data transformations. It loads a sample dataset using PySpark's built-in functionalities or reads data from external sources and converts it into a PySpark DataFrame for distributed processing and manipulation.

os pandas path kaggle curses gspread matplotlib fpdf google-oauth2 shutil linearregression pyspark-python kaggle-api pathlib pyspark-sql sparksession vectorassembler window-pyspark

UpdatedMar 31, 2025
Python

CamilaJaviera91 / sql-mock-data

Generate a synthetic dataset with one million records of employee information from a fictional company, load it into a PostgreSQL database, create analytical reports using PySpark and large-scale data analysis techniques, and implement machine learning models to predict trends in hiring and layoffs on a monthly and yearly basis.

python unicode sql random logging postgresql os faker locale pyspark connection matplotlib sys psycopg2 shutil pyspark-sql random-python sparksession

UpdatedApr 29, 2025
Python

mauryashobhit / cruise_ship_member_prediction

predicting number of crew memebers on a ship based on multiple parameters

vector pyspark linear sparksession stringindexer vectorassembler

UpdatedApr 13, 2022
Jupyter Notebook

LeftCoastNerdGirl / Big_Data

This project uses PySpark and SQL to analyze Big Data.

sql jupyter-notebook pyspark spark-sql pandas-python structured-query-language sparksession

UpdatedAug 25, 2024
Jupyter Notebook

Jim-by / Personalized-Recommendation-System

End-to-end personalised recommender system for e-commerce: synthetic data, PySpark, Delta Lake, model training, evaluation, monitoring, A/B test.

python hadoop jupyter-notebook pandas pyspark scipy matplotlib seaborn-plots delta-lake sparksession

UpdatedJun 12, 2025
Python

Improve this page

Add a description, image, and links to the sparksession topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sparksession topic, visit your repo's landing page and select "manage topics."