Numerical Summary Part-Ⅱ: Familiarizing Measures of Spread

In statistics, dispersion is the extent to which a distribution is stretched or squeezed. Common examples of measures of statistical dispersion are the variance, standard deviation, and interquartile range.
Published on May 08, 2021 | 17 min read

Sampling from a Population and its influence on its Estimates

A data distribution is a function or a listing which shows all the possible values (or intervals) of the data. It also (and this is important) tells you how often each value occurs. Often, the data in a distribution will be ordered from smallest to largest, and graphs and charts allow you to easi...
Published on May 01, 2021 | 4 min read

21 Questions you should ask before starting a DataDriven Project

A data-driven approach enables companies to examine and organise their data with the goal of better serving their customers and consumers. By using data to drive its actions, an organisation can contextualise and/or personalise its messaging to its prospects and customers for a more customer-cent...
Published on April 24, 2021 | 5 min read

Numerical Summary Part-Ⅰ: Understanding Measures of Central Tendency

In descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount of information as simply as possible.
Published on April 17, 2021 | 7 min read

2 Ways to Reduce Inflection in Text Data

Text normalization is the process of transforming text into a single canonical form that it might not have had before. Normalizing text before storing or processing it allows for separation of concerns, since input is guaranteed to be consistent before operations are performed on it.
Published on April 10, 2021 | 11 min read

Working with Multi-Panel-Plots in R

ggplot2 is a data visualization package for the statistical programming language R. Created by Hadley Wickham in 2005, ggplot2 is an implementation of Leland Wilkinson's Grammar of Graphics—a general scheme for data visualization which breaks up graphs into semantic components such as scales and ...
Published on April 03, 2021 | 3 min read

Getting access to Social Media APIs

Application Programming Interface(API) is an intermidiary which enables data exchange between a service and it's user. In order to access the data from API one needs to authenticate which is the topic of this markdown.
Published on March 27, 2021 | 5 min read

Extracting data from GitHub search API

The Search API is an XQuery library that combines searching, search parsing, search grammar, faceting, snippeting, search term completion, and other search application features into a single API. You can interact with the Search API through XQuery, REST, Node.
Published on March 20, 2021 | 4 min read