Choose category:


How to Write Interactive Python Scripts
Examples of how you can write scripts taking input from users and/or standard input Functions With Users Input In Python Below we


Document Similarity with Synset and Path Similarity
We will create the following functions: convert_tag: converts the tag given by nltk.pos_tag to a tag used by wordnet.synsets. You will need to use


Example of Machine Translation in Python and Tensorflow
We will build a deep neural network that functions as part of an end-to-end machine translation pipeline. The completed pipeline


Time Series Decomposition in Python
Time series decomposition is a technique that splits a time series into several components, each representing an underlying pattern category, trend,


Tutorial: Control Flow in Bash Scripting
In this tutorial we will provide basic examples of the conditional statements in bash scripting. Passing Variables to Scripts via


Pandas Dataframes Basics: Reshaping Data
In this series of posts, we will show you the basics of Pandas Dataframes which is one of the most


Anna Karenina Principle in Data Science
Anna Karenina novel begins with the quote: All happy families are alike; each unhappy family is unhappy in its own


10 Tips And Tricks For Data Scientists Vol.5
We have started a series of articles on tips and tricks for data scientists (mainly in Python and R). In


How to Run your first Airflow DAG in Docker
We have how to start running Airflow in Docker, in this post we will provide an example of how you


How to Resample Data by Group In Pandas
Sometimes when we are working on machine learning projects, there are some factors that can have a huge impact on


How to Start Running Apache Airflow in Docker
The simplest and fastest way to start Airflow is to run it with CeleryExecutor in Docker. We assume that you have


10 Tips and Tricks for Data Scientists Vol.4
We have started a series of articles on tips and tricks for data scientists (mainly in Python and R). In


How to Predict Runners’ Place in a Race
There are many different approaches to predict the winner of a race. The race can be any distance and the


10 Tips and Tricks for Data Scientists Vol.3
I have started a series of articles on tips and tricks for data scientists (mainly in Python and R). In


Redact Name Entities with SpaCy
When we work on NLP projects, we need to do text mining and data cleansing. A common task is to


How to Query S3 Objects with S3 Select
When we are dealing with CSV data on S3, it is usual to want to do some quick checks by


10 Tips and Tricks for Data Scientists Vol.2
We have started a series of articles on tips and tricks for data scientists (mainly in Python and R). In


10 Tips and Tricks for Data Scientists Vol.1
Introduction As data scientists, we love to do our job efficiently without reinventing the wheel. Tips-and-tricks articles provide snippets of


How to deploy multiple containers using docker-compose and Amazon Elastic Beanstalk’s multi container
Introduction We will provide an example of how you can work with multiple containers, how to share volumes and how


How to Redact PII Data using AWS Comprehend
Personal data, also known as personally identifiable information (PII) is any information relating to an identifiable person. Some examples of


How to Deploy an App to AWS using Elastic Beanstalk with Dockers
Assume that we work on AWS and we have built a Flask API and we want to deploy it. Let’s