Download Free Audio of Welcome to the lesson on exploratory data analysi... - Woord

Read Aloud the Text Content

This audio was created by Woord's Text to Speech service by content creators from all around the world.


Text Content or SSML code:

<speak> Welcome to the lesson on exploratory data analysis part-2. <break strength="strong"/>The next step in exploratory data analysis is the univariate analysis <break strength="weak"/>and the bivariate analysis. <break strength="x-strong"/> Next, <break strength="weak"/>we use univariate analysis and<break strength="weak"/> bivariate analysis for data analysis. <break strength="strong"/>You have seen detecting missing values and removing outliers. <break strength="x-strong"/> First, you will do the univariate analysis. <break strength="x-strong"/> What is a univariate analysis?<break strength="x-strong"/> Univariate analysis is the simplest form of analyzing data.<break strength="strong"/> Uni means one, <break strength="weak"/>so in other words, your data has only one variable. <break strength="x-strong"/> It does not deal with causes or relationships,<break strength="weak"/> and its purpose is to describe. <break strength="x-strong"/> It takes data, summarizes that data, <break strength="weak"/>and finds patterns in the data.<break strength="x-strong"/> In regression analysis, <break strength="weak"/>the data must be normally distributed (Gaussian distribution).  <break strength="x-strong"/> What is the normal distribution?<break strength="x-strong"/> Normally distributed data set when plotted<break strength="weak"/> must follow a bell-shaped symmetrical curve centered around the mean.<break strength="x-strong"/> It is better<break strength="weak"/> if the data has a gaussian distribution. <break strength="x-strong"/> Draw histogram using plot.hist method.<break strength="x-strong"/> You can see the chance of admission graph is a slightly skewed distribution towards the right. But still, <break strength="weak"/>it has some variation. <break strength="strong"/>It is varying from 0.4 to 0.9. <break strength="x-strong"/> You can observe enough variation in the chance of admit. <break strength="strong"/>This plot is a histogram plot. <break strength="x-strong"/> What is the histogram plot?<break strength="x-strong"/> A histogram is an approximate representation of the distribution of numerical data. <break strength="strong"/>For example, from 0.6 to 0.7, considered as a single bin like that. <break strength="x-strong"/> It gives the frequency of the values in the bin. <break strength="strong"/>If you see 80, <break strength="weak"/>there are 80 values in the bin corresponding to 0.7. <break strength="x-strong"/> Xlabel means we label the x-axis as the chances and its font size is 12.<break strength="strong"/> So we are plotting chances<break strength="x-strong"/> </speak>