5000, 5000, 6000, 10000, 11000, 12000, 13,000, 15000, 17,000, 20000, 21,000 25000, 26,000, 30000, 32000, 35000, 37000, 40000, 41000, 43000, 45000, 47,000 50000 Solution Value of Sales in Numbers Number of observations (frequency) 5000 but less than 10000 3 10000 but less than 15000 4 15000 but less than 20000 2 20000 but less than 25000 2 25000 but less than 30000 2 30000 but less than 35000 2 35000 but less than 40000 2 40000 but less than 45000 3 45000 but less than 50000 2
An example of the analysis of relationships is to examine the relationship between variables using statistical methods. An analysis may show that as a person increases their income, their spending will also increase. If the data show that such a relationship exists, a statistical model can be developed to examine the relationship between income and spending and whether or not that relationship is statistically significant - that is, it can be shown to be unlikely to have occurred by chance. The general approach to statistical inference is to develop and test a hypothesis about the relationship. This hypothesis may be, for example, the general pattern for a variable over time, observations collected from a sample of individuals, or income and spending patterns in different countries. If you are interested in the relationship between the closing price of the S&P 500 and the trading volume, you'd examine both sets of data. First, you need to determine that higher trading volume is associated with higher closing prices. A scatter plot is a useful tool for this analysis and the closing price is the dependent or response variable. The most common method is regression, which allows you to model any relationship in the data and then test hypotheses about the strength of that relationship, in which case a linear regression model is commonly used to test this hypothesis, which assumes a best fit line between the two sets of data and then tests to determine how well each data point fits, on average, from that line. Note that you can include multiple independent variables in your analysis, for example both trading volume and market sentiment in predicting closing prices. When more than one independent variable is included, it is referred to as multiple regression analysis - a method that is one of the most commonly used techniques in statistics. Multiple regression models allow you to examine the relationship between variables while controlling for the effects of other variables. The most common method is the ordinary least squares (OLS) regression, which can be applied on various types of cross-sectional or time series data. If you are interested in a binary (yes/no) outcome - for example, whether you are likely to be promoted from a job (yes, you are promoted, or no, you are not) based on your performance - you can use a logistic regression or a probit model. Analysis is often conducted using statistical software packages designed for these methods, such as STATA, SPSS, or R. These software packages can also perform more complex analysis to test hypotheses that the relationships identified by these methods are not simply the result of chance. Analysis is often conducted not just on the strength of relationships but also on testing it against theoretical expectations. It is important that the data used in the analysis are appropriate to be analyzed by a statistical model, and if that includes using your theoretical understanding of the relationships.
Matt FeiszliResearch ManagerFacebookDay 110:00 - 11:30amVideo Understanding: Time, and Scale (Slides)I will discuss the analysis of video understanding, particularly its applications at Facebook. I will focus on two key areas: representation and scale.