

Arrays in Bash
This tutorial is a brief introduction to arrays in bash. Data Scientists are familiar with arrays, in R, a simple


Uncovering Customer Purchasing Patterns with Market Basket Analysis: A Python-Based Tutorial for Shopify
Market basket analysis is important for an e-shop because it can provide insights into customer purchasing behavior and help identify


10 Tips And Tricks For Data Scientists Vol.11
We have started a series of articles on tips and tricks for data scientists (mainly in Python and R). In case you have


How to Get Data from Snowflake using Python
In a previous post, we provided an example of how to load data from S3 to Snowflake. Data Scientists and


How to Load Data from S3 to Snowflake
In this tutorial, we will show you how to create a database and a warehouse and how to load data


Automated Machine Learning Model Testing
We have all been in this situation that we didn’t know which model is optimum for our ML project and


How to create tables and query data in Redshift Spectrum from S3
In this tutorial, we will show you how to create several tables in Redshift Spectrum from data stored in S3.


How to Create an EMR Cluster
Let’s see how to create an EMR Cluster on AWS. Assuming that you have the required access (IAM Roles) we


An Example of a Data Science Pipeline in Python on Bike Sharing Dataset
Introduction We will provide a walk-through tutorial of the “Data Science Pipeline” that can be used as a guide for


Meta Analysis in R
Introduction We can analyze different scientific studies that address the same question by applying a meta-analysis. The assumption is that


How to Train a BERT Model with SageMaker
In this tutorial, we will provide an example of how we can train an NLP classification problem with BERT and