Statistics I - Unit Wise Questions

Unit 1: Introduction
4 Questions

4. Explain the role of statistics in computer science and information technology.

5 marks | Asked in Model Question

13. State with suitable examples the role played by computer technology in applied statistics and the role of statistics in information technology.

5 marks | Asked in 2078

12. Define primary data and secondary data and explain the difference between them.

5 marks | Asked in 2075

12. What do you mean by measurement scale? Describe the different types of measurement scales used in statistics.

5 marks | Asked in 2076

Unit 2: Descriptive Statistics
13 Questions

1. What are different methods of measuring dispersion. Sample of polythene bags from two manufactures, A, B are tested by a prospective buyer for bursting pressure and the results are as follows.


Which set of bags has more uniform pressure? If price are the same, which manufacture's bags would be preferred by buyer? Use appropriate statistical tool.

10 marks | Asked in 2078

2. Explain how box-plot is helpful to know the shape of the data distribution. The following data set represents the number of new computer accounts registered during ten consecutive days.


a) Compute the mean, median, quartiles, and sample standard deviation.

b) Check whether there are outliers or not.

c) If outliers are present, then delete the detected outliers and compute the mean, median, quartiles, and sample standard deviation again.

d) Make your conclusion about the effect of outliers on descriptive statistical analysis.

10 marks | Asked in Model Question

1.  Distinguish between absolute and relative measure of dispersion. Two computer manufacturers A and B compete for profitable and prestigious contract. In their rivalry, each claim that their computer a consistent. For this it was decided to start execution of the same program simultaneously on 50 computers of each company and recorded the time as given below.

Which company's computer is more consistent?

10 marks | Asked in 2075

1. What are the roles of measure of dispersion in descriptive statistics? Following table gives the frequency distribution of thickness of computer chips (in nanometer) manufactured by two companies.


Which company may be considered more consistent in terms of thickness of computer chips? Apply appropriate descriptive statistics.

10 marks | Asked in 2076

4. If 50 image of your website, 10 have black and white image, and their average scanned image occupies with 2.5 megabytes of memory. The total image occupies by the entire work 281 megabytes. Find the average occupies megabytes of those color images.

5 marks | Asked in 2078

5. Following table presents some descriptive statistics computed from three different independent sample dataset(X).


a) Compare sample mean and median, and explain about the shape of the data distribution for each dataset. Compare the variability of the three set of dataset. Box-plots have been generated through SPSS for each dataset as follows.


b) Do these box-plots support your findings obtained in a) about the shape of the distribution? Explain.

5 marks | Asked in Model Question

5. Calculate Q1, D7, and P58 from the following data and interpret the results.


5 marks | Asked in 2078

4. The following table gives the installation time (in minutes) for hardware on 50 different computers.


    If the average installation time is 30.2 minutes, find missing frequencies.

5 marks | Asked in 2076

4. Measurement of computer chip's thickness (in monometers) is recorded below.


    Find the mode of thickness of computer chips and interpret the result.

5 marks | Asked in 2075

5. The length of power failure in minute are recorded in the following table.


Find Q3, D2 and P40 and interpret the results

5 marks | Asked in 2076

5. Calculate Q3, D6, and P80 from the following data and interpret the results.


5 marks | Asked in 2075

9. Compute first four moments about arbitrary point 4 from following distribution and describe the characteristics of data.

    

5 marks | Asked in 2078

10. Compute percentile coefficient of kurtosis from the following data and interpret the result.


5 marks | Asked in 2075

Unit 3: Introduction to Probability
4 Questions

1. A new computer program consists of two modules. The first module contains an error with probability 0.2. The second module is more complex; it has a probability of 0.4 to contain an error, independently of the first module. An error in the first module alone causes the program to crash with probability 0.5. For the second module, this probability is 0.8. If there are errors in both modules, the program crashes with probability 0.9. Suppose the program crashed. What is the probability of errors in both modules?

10 marks | Asked in Model Question

6. A large chain retailer purchases a certain kind of electronic device from a manufacturer. The manufacturer indicates that the defective rate of the device is 3%.

a) The inspector randomly picks 20 items from a shipment. What is the probability that there will be at least one defective item among these 20?

b) Suppose that the retailer receives 10 shipments in a month and the inspector randomly tests 20 devices per shipment. What is the probability that there will be exactly 3 shipments each containing at least one defective device among the 20 that are selected and tested from the shipment?

5 marks | Asked in Model Question

6. A manufacturing company employs three analytical plans for the design and development of a particular product. For cost reasons, all three are used at varying times. In facts, plan 1,2 and 3 are used for 30%, 20% and 50% of the products respectively. The defect rate in different procedures is as follows: P(D/P1) = 0.01, P(D/P2) = 0.03, P(D/P3)= 0.02, where P(D/PJ) is the probability of a defective product, given plan j. If a random product was observed and found to be defective, which plan was most likely used and thus responsible?

5 marks | Asked in 2076

9. A large chain retailer purchases a certain kind of electronic device from a manufacturer. The manufacturer indicates that the defective rate of the device is 15%. The inspector randomly picks 10 items from a shipment. What is the probability that there will be at least one defective item among these 10? 

5 marks | Asked in 2076

Unit 4: Sampling
4 Questions

12. What do you mean by sampling? Explain the difference between stratified sampling and cluster sampling.

5 marks | Asked in 2078

12. Write short notes on the following.

    a) Sampling error and non-sampling error

    b) Conditional probability

5 marks | Asked in Model Question

13. What do you mean by sampling? Explain non probability sampling with merits and demerits.

5 marks | Asked in 2075

13. What is sampling? Discuss various probability sampling techniques with merits and demerits.

5 marks | Asked in 2076

Unit 5: Random Variables and Mathematical Expectation
2 Questions

7. The random variable X has following probability distribution.


Find (i) E(X) and var(X) (ii) Calculate E(Y) if Y = 3X + 5.

5 marks | Asked in 2076

11. The lifetime of a certain electronic component is a normal random variate with the expectation of 5000 hours and a standard deviation of 100 hours. Compute the probabilities under the following conditions

a) Lifetime of components is less than 5012 hours

b) Lifetime of components between 4000 to 6000 hours

c) Lifetime of components more than 7000 hours

5 marks | Asked in Model Question

Unit 6: Probability Distributions
16 Questions

3. (a) What do you understand by Poisson distribution? What are its main features?

    (b) What do you mean by joint probability distribution function? Write down its properties.

10 marks | Asked in 2078

3.  (a). What do you understand by binomial distribution? What are its main features?

    (b). What do you mean by marginal probability distribution? Write down its properties.

10 marks | Asked in 2075

3. (a) Define Normal distribution. What are the main characteristics of a Normal distribution?

    (b) What do you mean by probability density function? Write down its properties.

10 marks | Asked in 2076

6. The following joint probability data apply to fatigue test to be run on bronze strips. X represent to failure (in 105) when alternate strips are bent at a high level of deflection. Y represent the same at a lower deflection level.

    

    a) Find the marginal probability distribution for X and for Y.

    b) Determine the conditional probability distribution of Y given X=5.

    c) Are X and Y independent?

5 marks | Asked in 2078

7. Messages arrive at an electronic message center at random times, with an average of 9 messages per hour.

a) What is the probability of receiving at least five messages during the next hour?

b) What is the probability of receiving exactly seven messages during the next hour?

5 marks | Asked in Model Question

7. Fit a binomial distribution to the following data.


5 marks | Asked in 2078

8. The time, in minutes, it takes to reboot a certain system is a continuous variable with the density function:

Compute C, and then compute the probability that it takes between 1 and 2 minutes to reboot.

5 marks | Asked in Model Question

8. If two random variables have the joint probability density function


Find (i) constant k    (ii) Conditional probability density function of X given Y    (iiI) identify whether X and Y are independent.

5 marks | Asked in 2078

6. Define a random variable. For the following bi-variants probability distribution of X and Y , find

    [i] marginal probability mass function of X and Y ,

    [ii] P(x≤1, Y=2),

    [iii] P(X≤1)


5 marks | Asked in 2075

10. The lifetime of a certain electronic component is a normal random variate with the exception of 5000 hours and a standard deviation of 100 hours. Compute the probabilities under the following conditions:

    a) Lifetime of components is less than 4000 hours

    b) Lifetime of components between 3000 to 6500 hours

    c) Lifetime of components more than 6000

5 marks | Asked in 2078

10. Define exponential distribution with parameter Î» . The time required to reach to the printer after ordering in the computer follows exponential distribution at an average rate of 3 jobs per hour.

a) What is the expected time between jobs?

b) What is the probability that the next job is sent within 5 minutes?

5 marks | Asked in Model Question

7. If two random variables have the joint probability density function


Find (i) constant k (ii) Conditional probability density function of X and given Y (iii) Var(3X + 2Y)

5 marks | Asked in 2075

8. If two random variables have the joint probability density function


Find (i) constant k (ii) Conditional probability density function of x given Y (iii) Identify whether X and Y are independent.

5 marks | Asked in 2076

8. A certain machine makes electrical resistors having mean resistance of 40 ohms and standard deviations of 2 ohms. Assuming that the resistance follows a normal distribution.

    (i) What percentage of resistors will have a resistance exceeding 43 ohms?

    (ii) What percentage of registors will have a resistance between 30 ohms to 45 ohms?

5 marks | Asked in 2075

10. Messages arrive at an electronic message center at random times, with an average of 9 messages per hour.

    a) What is the probability of receiving at least four messages during the next hour?

    b) What is the probability of receiving at most three messages during the next hour?

5 marks | Asked in 2076

11. Write the properties of Poisson distribution. Fit a poision distribution and find the expected frequencies.


5 marks | Asked in 2075

Unit 7: Correlation and Linear Regression
8 Questions

2. Write the properties of correlation coefficient. The time it takes to transmit a file always depends on the file size. Suppose you transmitted 30 files, with the average size of 126K bytes and the standard deviation of 35 Kbytes. The average transmitted time was 0.04 seconds with the standard deviation 0.01 seconds. The correlation coefficient between the time and size was 0.86. Based on these data, fit a linear regression model and predict the time it will take to transmit a 400K byte file.

10 marks | Asked in 2078

3. A computer manager interested to know how efficiency of his/her new computer program which depends on the size of incoming data. Efficiency will be measured by the number of processed requests per hour. In general, larger data sets require more computer time, and therefore, fewer requests are processed within 1 hour. Applying the program to data sets of different sizes, the following data were gathered.


a) Identify which one response variable, and fit a simple regression line, assuming that the relationship between them is linear.

b) Interpret the regression coefficient with reference to your problem.

c) Obtain coefficient of determination, and interpret this.

d) Based on the fitted model in (a), predict the efficiency of new computer for data size 12(gigabytes). Does it possible to predict efficiency for data size of 30 gigabytes? Discuss.

10 marks | Asked in Model Question

2. In a certain type of metal test specimen. the effect of normal stress on a specimen is known to be functionally related to shear resistance. The following table gives the data on the two variables.


    (i). Identify which one is response variable, and fit a simple regression line, assuming that the relationship between them is linear.

    (ii). Interpret the regression coefficient with reference to your problem.

    (iii). Obtain the coefficient of determination, and interpret this.

    (iv). Based on the fitted model in (a), predict the shear resistance for normal stress of 30 kilogram per square centimeter .

10 marks | Asked in 2075

2. A study was done to study the effect of ambient temperature on the electric power consumed by a chemical plant. Following table gives the data which are collected from an experimental pilot plant. 

    (i) Identify which one is response variable, and fit a simple regression line, assuming that the relationship between them is linear.

    (ii) Interpret the regression coefficient with reference to your problem.

    (iii) Obtain coefficient of determination, and interpret this.

    (iv) Based on the fitted model in (a), predict the power consumption for an ambient temperature of 650F.

10 marks | Asked in 2076

9. Following data represent the preference of 10 students studying B.Sc .(CSIT) towards two brands of computers namely DELL and HP.

Apply appropriate statistical tool to measure whether the brand preference is correlated. Also interpret your result.

5 marks | Asked in Model Question

11. Calculate Spearman's rank correlation coefficient for the following ranks given by three judges in a music contest.

Indicate which pair of judges has the nearest approach to music.

5 marks | Asked in 2078

9. As part of the study of the psychobiological correlates of success in athletes, the following measurements are obtained from members of Nepal national football team.


    Calculate Spearman's rank correlation coefficient.

5 marks | Asked in 2075

11. Following data represent the preference of 10 students studying B.Sc.(CSIT) towards two brands of computers namely DELL and HP.


    Apply appropriate statistical tool to measure whether the brand preference is correlated. Also interpret your result.

5 marks | Asked in 2076