Implemented an end-to-end data pipeline with Spark on Amazon EMR, which process TB-scale data Experienced in extract transform and load (ETL) processing large datasets of different forms with AWS Glue Previous High Performance GPU Computing FEATURED TAGS HTML C GPU Parallel Programming System Programming AWS Glue Amazon EC2 Amazon EMR Big Data CSS Distributed System Java Javascript MongoDB MySQL RESTful API Recommendation System Scala Spark