STA301 Definitions and Formulas From Lec 1 to 22 | Virtual Study Solutions

Adsetra Ads

 

STA301 Definitions and Formulas

Here You can read and download STA301 - Statistics and Probability Definitions and Formulas From Lecture 01 to 22 for Preparation of Mid Term Exam. Previously we shared STA301 Midterm Subjective Past Papers.
STA301 - Statistics and Probability Definitions and Formulas From Lecture 01 to 22
STA301 - Statistics and Probability Definitions and Formulas From Lecture 01 to 22

Download STA301 Definitions and Formulas From Lec 01 to 22 

Definitions From Lecture 01 to 22 

Statistics:

Statistics is that science which enables to draw conclusions about various phenomena one the basis of real data collected on sample basis.

OR

Statistics is a science of facts and figures.


INFERENTIAL STATISTICS:

That branch of Statistics which enables us to draw conclusions or inferences about various phenomena on the basis of real data collected on sample basis.

Data:

A well defined collection of objects is known as data.

Qualitative data:

Data that are labels or names used to identify an attribute of each element. Qualitative data may be non numeric or numeric.

Qualitative variable:
A variable with qualitative data.

Quantitative data:

Data that indicate how much or how many of something. Quantitative data are always numeric.

Variable:
A measurable quantity which can vary from one individual or object to another is called a variable.

Nominal Scale:
The classification or grouping of observations into mutually exclusive qualitative categories is said to constitute a nominal scale e.g. students are classified as male and female.

Ordinal Scale:

It includes the characteristic of a nominal scale and in addition has the property of ordering or ranking of measurements e.g. the performance of students can be rated as excellent, good or poor.

Interval Scale:

A measurement scale possessing a constant interval size but not true zero point is called an Interval Scale.

Ratio Scale:

It is a special kind of an interval scale in which the scale of measurement has a true zero point as its origin.

Biased Errors:

An error is said to be biased when the observed value is consistently and constantly higher or lower than the true value.

Unbiased Errors or Random Errors:

An error, on the other hand, is said to be unbiased when the deviations, i.e. the excesses and defects, from the true value tend to occur equally often.

Primary Data:
The data published or used by an organization which originally collected them are called primary data thus the primary data are the first hand information collected, complied, and published by an organization for a certain purpose.

Secondary Data:
The data published or used by an organization other than the one which originally collected them are known as secondary data.


DIRECT PERSONAL INVESTIGATION: 

In this method, an investigator collects the information personally from the individuals concerned. Since he interviews the informants himself, the information collected is generally considered quite accurate and complete.

INDIRECT INVESTIGATION:

Sometimes the direct sources do not exist or the informants hesitate to respond for some reason or other. In such a case, third parties or witnesses having information are interviewed.

Population:

The collection of all individuals, items or data under consideration in statistical study is called Population.

Sample: 

A sample is a group of units selected from a larger group (the population). By studying the sample it is hoped to draw valid conclusions about the larger group.

OR

Sample is that part of the Population from which information is collected.

SAMPLING FRAME:

A sampling frame is a complete list of all the elements in the population.

Sampling Error:
The sampling error is the difference between the sample statistic and the population parameter.

Non-Sampling Error:

Such errors which are not attributable to sampling but arise in the process of data collection even if a complete count is carried out.

Sampling with replacement:

Once an element has been included in the sample, it is returned to the population. A previously selected element can be selected again and therefore may appear in the sample more than once.

Sampling without replacement:

Once an element has been included in the sample, it is removed from the population and cannot be selected a second time.

Standard error:

The standard deviation of a point estimator.

AND

The degree of scatter of the observed values about the regression line measured by what is called standard deviation of regression or standard error of estimate.

Sampling Unit:

The units selected for sampling. A sampling unit may include several elements.

NON-RANDOM SAMPLING:
Nonrandom sampling’ implies that kind of sampling in which the population units are drawn into the sample by using one’s personal judgment. This type of sampling is also known as purpose sampling.


Quota Sampling:

Quota sampling is a method of sampling widely used in opinion polling and market research. Interviewers are each given a quota of subjects of specified type to attempt to recruit for example, an interviewer might be told to go out and select 20 adult men and 20 adult women, 10 teenage girls and 10 teenage boys so that they could interview them about their television viewing.

RANDOM SAMPLING:
The theory of statistical sampling rests on the assumption that the selection of the sample units has been carried out in a random manner.

By random sampling we mean sampling that has been done by adopting the lottery method.

Simple random sampling:
Finite population: a sample selected such that each possible sample of size n has the same probability of being selected. Infinite population: a sample selected such that each element comes from the same population and the elements are selected independently.

Pie Chart:

Pie Chart consists of a circle which is divided into two or more mars in accordance with the number of distinct classes that we have in our data.

SIMPLE BAR CHART:
A simple bar chart consists of horizontal or vertical bars of equal width and lengths proportional to values they represent.

MULTIPLE BAR CHARTS:
This kind of a chart consists of a set of grouped bars, the lengths of which are proportionate to the values of our variables, and each of which is shaded or colored differently in order to aid identification.

CLASS BOUNDARIES:

The true class limits of a class are known as its class boundaries.

HISTOGRAM:

A histogram consists of a set of adjacent rectangles whose bases are marked off by class boundaries along the X-axis, and whose heights are proportional to the frequencies associated with the respective classes.

FREQUENCY POLYGON:

A frequency polygon is obtained by plotting the class frequencies against the mid-points of the classes, and connecting the points so obtained by straight line segments.

FREQUENCY CURVE:

When the frequency polygon is smoothed, we obtain what may be called the frequency curve.

CUMULATIVE FREQUENCY DISTRIBUTION:

As in the case of the frequency distribution of a discrete variable, if we start adding the frequencies of our frequency table column-wise, we obtain the column of cumulative frequencies.

AVERAGES (I.E. MEASURES OF CENTRAL TENDENCY):

A single value which intended to represent a distribution or a set of data as a whole is called an average. It is more or less a central value around which the observations tend to cluster so it is called measure of central tendency. Since measure of central tendency indicate the location of the distribution on X axis so it is also called measure of location.

The Arithmetic, Geometric and Harmonic means

Are averages that are mathematical in character, and give an indication of the magnitude of the observed values.

The Median

Indicates the middle position while the mode provides information about the most frequent value in the distribution or the set of data.

THE MODE:

The Mode is defined as that value which occurs most frequently in a set of data i.e. it indicates the most common result.

DOT PLOT:

The horizontal axis of a dot plot contains a scale for the quantitative variable that we want to represent. The numerical value of each measurement in the data set is located on the horizontal scale by a dot.

GROUPING ERROR:
“Grouping error” refers to the error that is introduced by the assumption that all the values falling in a class are equal to the mid-point of the class interval.

Ogive:

A graph of a cumulative distribution.

Dispersion:

The variability that exists between data set.

Range:

The range is defined as the difference between the maximum and minimum values of a data set.

The coefficient of variation:

The coefficient of variation expresses the standard deviation as the percentage of the arithmetic mean.

Quartiles:

Quartiles are those three quantities that divide the distribution into four equal parts.
Quartile Deviation:

The quartile deviation is defined as half of the difference between the first and third quartiles.

Quantiles:

Collectively the quartiles, the deciles, percentiles and other values obtained by equal sub-division of the data are called quantiles.

Percentiles:

Percentiles are those ninety nine quantities that divide the distribution into hundred equal parts

Absolute measure of dispersion:
An absolute measure of dispersion is one that measures the dispersion in terms of the same units or in the square of units, as the units of the data.

Relative measure of dispersion:


Relative measure of dispersion is one that is expressed in the form of a ratio, co-efficient of percentage and is independent of the units of measurement.

COEFFICIENT OF QUARTILE DEVIATION:
The Coefficient of Quartile Deviation is a pure number and is used for COMPARING the variation in two or more sets of data.

Mean Deviation:

The mean deviation is defined as the arithmetic mean of the deviations measured either from the mean or from the median, all deviations being counted as positive.

Variance:

Variance is defined as the square of the standard deviation.

Standard Deviation:

Standard Deviation is defined as the positive square root of the mean of the squared deviations of the values from their mean.

Five-number summary:

An exploratory data analysis technique that uses the following five numbers to summarize the data set: smallest value, first quartile, median, third quartile, and largest value.

Moments:

Moments are the arithmetic means of the powers to which the deviations are raised.

Correlation: 

Correlation is a measure of the strength or the degree of relationship between two random variables. OR Interdependence of two variables is called correlation. OR Correlation is a technique which measures the strength of association between two variables.

RANDOM EXPERIMENT:

An experiment which produces different results even though it is repeated a large number of times under essentially similar conditions is called a Random Experiment.

SAMPLE SPACE: 

A set consisting of all possible outcomes that can result from a random experiment (real or conceptual), can be defined as the sample space for the experiment and is denoted by the letter S. Each possible outcome is a member of the sample space, and is called a sample point in that space.

SIMPLE & COMPOUND EVENTS:
An event that contains exactly one sample point is defined as a simple event. A compound event contains more than one sample point, and is produced by the union of simple events.

MUTUALLY EXCLUSIVE EVENTS:
Two events A and B of a single experiment are said to be mutually exclusive or disjoint if and only if they cannot both occur at the same time i.e. they have no points in common.

EXHAUSTIVE EVENTS:

Events are said to be collectively exhaustive, when the union of mutually exclusive events is equal to the entire sample space S.

EQUALLY LIKELY EVENTS:

Two events A and B are said to be equally likely, when one event is as likely to occur as the other. In other words, each event should occur in equal number in repeated trials.

Skewness:

Skewness  is the lack of symmetry in a distribution around some central value (mean, median or mode).It is thus the degree of a symmetry.

Kurtosis:

Kurtosis is the degree of peakness of a distribution usually taken relative to a normal distribution.

MOMENTS:

A moment designates the power to which deviations are raised before averaging them.

Probability:

Probability is defined as the ratio of favorable cases over equally likely cases.

SUBJECTIVE OR PERSONALISTIC PROBABILITY:

Probability in this sense is purely subjective, and is based on whatever evidence is available to the individual. It has a disadvantage that two or more persons faced with the same evidence may arrive at different probabilities.

Conditional Probability:
The probability of the occurrence of an event A when it is known that some other event B has already occurred is called the conditional probability.

MULTIPLICATION LAW:

“The probability that two events A and B will both occur is equal to the probability that one of the events will occur multiplied by the conditional probability that the other event will occur given that the first event has already occurred.”

INDEPENDENT EVENTS:
Two events A and B in the same sample space S, are defined to be independent (or statistically independent) if the probability that one event occurs, is not affected by whether the other event has or has not occurred,

Baye’s theorem:
A method used to compute posterior probabilities.

RANDOM VARIABLE:

Such a numerical quantity whose value is determined by the outcome of a random experiment is called a random variable.

Discrete random variable:

A random variable that may assume either a finite number of values or an infinite sequence of values.

Set:

A set is any well-defined collection or list of distinct objects, e.g. a group of students, the books in a library, the integers between 1 and 100, all human beings on the earth, etc. 

SUBSET: 

A set that consists of some elements of another set, is called a subset of that set. For example, if B is a subset of A, then every member of set B is also a member of set A. If B is a subset of A, we write: (B ÃŒ A)


PROPER SUBSET:

If a set B contains some but not all of the elements of another set A, while A contains each element of B, then the set B is defined to be a proper subset of A.

Universal Set:
The original set of which all the sets we talk about, are subsets, is called the universal set (or the space) and is generally denoted by S or Ω.

VENN DIAGRAM:

A diagram that is understood to represent sets by circular regions, parts of circular regions or their complements with respect to a rectangle representing the space S is called a Venn diagram, named after the English logician John Venn (1834-1923).

The Venn diagrams are used to represent sets and subsets in a pictorial way and to verify the relationship among sets and subsets.

UNION OF SETS:

The union or sum of two sets A and B, denoted by (A È B) , and read as “A union B”, means the set of all elements that belong to at least one of the sets A and B, that is

(A È B)

INTERSECTION OF SETS:

The intersection of two sets A and B, denoted by A ∩B, and read as “A intersection B”, means that the set of all elements that belong to both A and B.

OPERATIONS ON SETS:

The basic operations are union, intersection, difference and complementation.

SET DIFFERENCE:

The difference of two sets A and B, denoted by A – B or by A – (A ∩ B), is the set of all elements of A which do not belong to B.

DISJOINT SETS:

Two sets A and B are defined to be disjoint or mutually exclusive or non-overlapping when they have no elements in common, i.e. when their intersection is an empty set i.e.

A Ç B = f

Conjoint Sets:

Two sets A and B are said to be conjoint when they have at least one element in common.

COMPLEMENTATION:
The particular difference S – A, that is, the set o f all those elements of S which do not belong to A, is called the complement of A and is denoted by A or by Ac.

POWER SET:

The class of ALL subsets of a set A is called the Power Set of A and is denoted by P(A). For example, if A = {H, T}, then P(A) = {φ, {H}, {T}, {H, T}}.

CARTESIAN PRODUCT OF SETS:

The Cartesian product of sets A and B, denoted by A × B, (read as “A cross B”), is a set that contains all ordered pairs (x, y), where x belongs to A and y belongs to B.

This set is also called the Cartesian set of A and B set of A and B, named after the French mathematician Rene’ Descartes (1596-1605).

TREE DIAGRAM:

The “tree” is constructed from the left to the righ t. A “tree diagram” is a useful device for enumerating all the possible outcomes of two or more sequential events.

PERMUTATION:

A permutation is any ordered subset from a set of n distinct objects.

OR

An arrangement of all or some of a set of objects in a definite order is called permutation. 

COMBINATION:

A combination is any subset of r objects, selected without regard to their order, from a set of n distinct objects.

OCCURRENCE OF AN EVENT:

An event A is said to occur if and only if the outcome of the experiment corresponds to some element of A.

PARTITION OF SETS:

A partition of a set S is a sub-division of the set into non-empty subsets that are disjoint and exhaustive, i.e. their union is the set S itself.

CLASS OF SETS:

A set of sets is called a class. For example, in a set of lines, each line is a set of points.

DISTRIBUTION FUNCTION:
The distribution function of a random variable X, denoted by F(x), is defined by F(x) = P(X < x).

STA301 – Statistics and Probability Mid Term Solved Papers

You can download STA301 Past Mid Term Solved Papers from the links below: 
If You liked This Post Please share with Your Friends. Thank You.

Post a Comment

 

Top