Tuesday, June 4, 2019
History Of Weibull Distribution
History Of Weibull DistributionIn fortune theory and statistics, the Weibull dispersal is one of the well-nigh important ceaseless probability disseminations. It was, first introduced by W. Weibull in 1939 when he was studying the issue of structural strength and life info analysis, and was formally named after him later in 1951. He proposed the string model to explain the structural strength. Based on the assurance that a structure is composed of several small components (n pieces) in series, we could get wind the structure as being composed of an n-rings chain, the strength of which (or life) completely depends on the weakest rings strength (or life). In his model, with the guess that the strength of opposite rings argon autarkic and identically distributed, finding the strength scattering of the chain fashion the problem of finding the scattering of the weakest ring.Due to the result of research conducted by Gnedenko (1943), no matter what the original disseminati on of the variable is, the asymptotic distribution of the minimum could only be three different forms. The Weibull distribution is one of them.Since Weibull distribution is ceremonious on the weakest link model, which could sufficiently reflect the defect of worldly and the effects of dialect concentration, it has been considered as train aside model to describe strength of fiber substantial in practical screening.1.2 Stress-Strength AnalysisStress-Strength Analysis is the analysis of the survival of materials subjected to a random stress. Stress-Strength Analysis is a commonly social occasiond tool in reliableness engineering.The stress-strength reliability is given byWhere is the pdf of the strength distribution and is the cumulative distribution function of the stress distribution.In this case, the data for the strength set would be veritable data that is indicative of the strength of the material (i.e. maximum applied stress to cause failure), and the stress data would be actual stress data of the material under use conditions.1.3 Research Motivation TargetJudging from material of coulomb fibers, microscopic flaws cause bendable failure to occur below the native strength of the fiber. Therefore, analysis of tensile failure data should provide insight into the distribution of flaws within fiber. The fiber might be viewed as a chain of interlocking links, with the flaws acting as weak links in the chain. As the same model mentioned previously, the chain provide fail at its weakest link. Usually tensile data do not confirm to rigid statistical distributions. Therefore, they cannot be represented by the specific fixed statistical distribution accurately. Instead, it is necessary to use a flexible distribution that enables the cause of the distribution function to be altered by the data itself. Under this situation, the Weibull distribution is one the natural choice.Although found on the theoretical model, Weibull distribution should be an appr opriate description strength distribution for carbon fibers. However, judging from experiential data, it always suggests that the Weibull distribution is not an appropriate model for carbon fibers. One plausible problem is validity of independency assumption in the chain of links model. In that model, we submit a physical system is consisted of n identical units or items connected in series and for each one unit should be independently and identically distributed. However, in practice, this strict assumption cannot be guaranteed, which will cause the discrepancy of between real strength and Weibull distribution. For example, if there is a defect hole on n-th link, this defect hole will probably expend to adjacent link(n-1-th and n+1-th). That causes the problem of dependency.Another popular plausible explanation is clamp effects which is analysed in detail by capital of Arizona and Sexsmith. In building upon this work, Stoner developed new end-effect Weibull model, in which dist inct Weibull distributions were utilise to characterize failures from true flaws and from artifacts in carbon fibers.Although, to some extent, the end-effect Weibull model accurately descripts the data upon which it was based, it provokes our matter tos in pre-stress issue applied on carbon fiber forrader it got foot raceed. Because in practice, such as in the strength rill, fiber always fit stress before they can be tested. For example, in the popular experiment proposed by Bader Pries, there is no guaranty that single fiber could be stately without any slight press of damage. Thus, in that experiment, they stress the fibers before they get tested, which will cause the strength test result representing all fibers performance survived from pre-stress.This interrogative sentence arise our interest in study of pre-stress Weibull distribution. In the problem we will discuss in this thesis, we assume that original strength of fiber is Weibull distributed. Therefore, if there is no pre-stress applied on fiber material, the final strength will be also Weibull distributed without any changes from origins. However, judging from common sense, pre-stress could not be avoided exclusively before the strength test. They usually could occur in shipping procedure, and pre-test preparation procedure. Our goal is to find out under what conditions these pre-stressed fiber have a (approximately) Weibull distribution.In the stand by chapter, the results of survival fiber strength distribution under multi-type pre-stressed condition have been given out. At same time, a minimum censoring proportion has been set up to assure the pre-stress has a markingificant effect on original fiber material.In the third chapter, MSE is used to measure the fitness of the pre-stressed criminalise essay and nearest Weibull distribution. Moreover, we discuss the goodness of fit test applied to the pre-stressed criminalise sample and Weibull distribution with line of reasonings rank equ al to MLEs from censored sample, which is considered as nearest Weibull distribution based on the censored sample.In the fourth chapter, the simulation results of two methods proposed in the third chapter have been discussed and analyzed.Finally, we will give out a culmination about survival distribution of pre-stress Weibull distribution.Chapter 2Weibull Family and Pre-Stressed Censored Sample2.1 Basic Properties of Weibull DistributionThe probability density function of a Weibull random variable isThe parameter is the shape parameter, is the scale parameter, and is the location parameter of the distribution. When , this reduces to the usual two-parameter Weibull distribution.The Weibull distribution is related to many other probability distributions in cross, it interpolates between the exponential distribution () and the Rayleigh distribution ().The CDFThe cumulative distribution function for the two parameter Weibull distribution isfor , and for .The failure rate (or hazard ra te) is given byThe MeanThe mean, , of Weibull pdf is given byWhere is the da Gamma function evaluated at the value of .The MedianThe median, , is given byThe ModeThe mode, , is given byThe Standard DeviationThe standard deviation, is given byApplicationsThe Weibull distribution has multiple applications in practical world.Survival analysisReliability engineeringWeather forecastingGeneral insurance2.2 Censored Weibull SampleBecause our primary interest is to study strength distribution of fiber material after pre-stressed, we need to generate various censored Weibull sample which is applied multi-type pre-stress.Given and are independent random variables where represent original strength of material and pre-strength individually. So the problem of our interest is what distribution does variable have? To state the problem more clearly, we assume the original strength of material always yield to Weibull distribution family. And the pre-stress yield to three different distribution famil ies which are Weibull, Normal and Gamma. Based on different parameter choice, we will try to find out what determine of the parameters give us that survival strength yield to or approximately yield to Weibull distribution.Mentioned in Chapter 1, we assume is weibull distributed with different shape and scale parameter. In this paper, we will from (pre-stress variable) from three different distribution Weibull distribution, Gamma distribution, and Normal distribution. Also as similar as original strength set, in each distribution option, different parameters are chosen to generate various censored Weibull sample.Meanwhile, since trivial results are not what we expected, a certain censoring ratio has been set up to guarantee there is a significant effect of pre-stress applied on original fiber carbon material. Serving to this purpose, samples with censoring proportion is 0.5 or greater than 0.5 will be kept, which is considered as plausible limit of censoring ratio.Chapter 3 Mean Squ ared faulting and Goodness of Fit TestThe Mean Squared Error (MSE) is a measure of how close fitted curve is to data points. For every data point, the vertical blank from the point to the corresponding value on the fitted curve (the error) will be taken, and the value will be squared. Then those squared values have been added up for all data points, and been divided by the number of points, which is considered as a mean. Since all errors have been switched into positive values on matter what original sign they have, negative values do not cancel positive values. The smaller the Mean Squared Error, the closer the fit curve is to the data points.MSE has been widely used for quantitative performance metric in the field of statistical regression and engineering, such as signal processing.3.1 Comparison of MSE of Censored Weibull SampleBased on the censored Weibull Sample we got, we will calculate out MLE of parameters for Weibull distribution, noted as and , which is considered as nea rest Weibull distribution to censored weibull sample. Since our objective is to measure fitness of censored Weibull sample and nearest Weibull distribution, MSE between of censored Weibull sample and Weibull distribution with parameters and . We establish the MSE as following wayWhere are censored Weibull sample which is sorted as . is empirical CDF at sample point . is defined as sample size. And is CDF of Weibull distribution with parameter , which is MLEs got based on censored Weibull sample.In addition, a service line of MSE comparison is established by calculating MSE between the sample of size from Weibull() and Weibull() distribution.Where consist the sample of size from Weibull(), which is sorted as ascending order . is empirical CDF at sample point . And is CDF of Weibull distribution with parameter .3.2 Comparison via Simulation ResultWe conducted a large-scale simulation study to compare of the performance of contrast between base MSE and sample MSE depends on different censoring stress distribution and parameter chosen. For example, let original strength distribution yields to Weibull(1,1), and censoring stress distribution is Gamma distribution with parameter value chosen from the range 0.25 to 2.5, then we will focus on the performance of fitness of cersored sample to Weibull distribution from various aspects such as censoring proportion, shape parameter value of censoring distribution and scale parameter value of censoring distribution.From above figure, we could not hardly see that the MSE increases with increasing of cersoring proportion, which could be simply interpreted as survival data will go off the original strength distribution when considerable proportion of original strength failed the pre-stress test. Meanwhile, the base MSE keeps relatively stable, since it will not be affected by different pre-stress distribution. Therefore, it is obvious to tell that dissimilarity between MSE and base MSE is raised up when the censoring proporti on increases, which indicates fitness will go worse at same time.The series of comparison MSE chart are arranged by the ascending order of value of Shape Parameter of Gamma Distribution. If we focus on the changing trend of Average leaving between Sample MSE and Base MSE, we could easily figure out that the Average Difference decreases as long as value of shape parameter become smaller and smaller. This shows that fitness of sample is improved by using smaller shape parameter of pre-stress Gamma distribution.On the other side, if we focus on each individual chart, it is not too hard to find that when the value of Scale increases, difference between Sample MSE and Baseline decreases, which indicates fitness is improved by large scale parameter chosen. This phenomenon could be interpreted as larger scale brings in less spread compared with smaller ones create absolute peak in pdf, which breaks the continuity of censored sample.Above figure is overall behavior of MSE based on differ ent choice of shape and scale parameter of pre-stress gamma distribution. To abstract up, In general MSE boosts up where is smaller scale value and larger shape values. Fitness performance indicates censored sample is still Weibull (or approximately Weibull) distributed when shape and scale parameter choice close to minimum and maximum individually.Chapter 4 Fit Study Based on Weibull Goodness-of-Fit TestsIn order to study whether the censored weibull distribution data yields to a weibull distribution, we first generate different censored weibull samples based on different censoring function. Then we use the weibull distribution with parameters value equal to MLE of censored sample as hypothesis. Three Goodness-of-Fit Tests are manipulated to censored weibull samples. We check whether simulation results shows censored weibull sample still yields to a weibull distribution.Goodness-of-fit testsGoodness-of-fit tests for the two-parameter Weibull distributions have displace considerab le attention since its tiny importance. Mann and others (1973), Smith and Bain (1976), Stephens (1977), Littell and others (1979), Chandra and others (1981), Tiku and Singh (1981), Wozniak and Warren (1984), and James and others (1989) have gave out universal discussion of this problem. Mann and others (1973) and Tiku and Singh (1981) proposed new statistics to test the goodness-of-fit of two-parameter Weibull distribution. ), Smith and Bain (1976) proposed a test statistic to test normality which is analogous to the Shapiro-Francis statistic. The Smith and Bain statistic was derived from the sample correlation between the order statistics of a sample and the expected value of the order statistics under the assumption that the sample comes from a two-parameter Weibull distribution. They provided critical values for samples containing 8, 20, 40, 60, 80 observations. Stephens (1977) provided tables of the asymptotic critical values of the Anderson-Darling statistic and the Cramer-von Mises statistic for various significance levels. Littell and others (1979) made a comparison among the Mann, Scheuer, and Fertig statistic, the Smith and Bain statistic, the circumscribed Kolmogorov-Smirnov statistic, the limited Anderson-Darling statistic, and the modified Cramer-von Mises statistics through a various of power studies for sample size n = 10 to 40. Critical values for the , , and statistics for n = 10 to 40 have been calculated and provided at the same time. Chandra and others (1981) calculated critical values for the Kolmogorov-Smirnov statistic for n = 10, 20, 50 and infinity for three situations. James and others produced extensive tables of goodness-of-fit critical values for the two parameter Weibull distributions developed through simulation for the Kolmogorov-Smirnov statistic, the Anderson-Darling statistic, and Shapiro-Wilk-type correlation statistics.Kolmogorov-Smirnov statisticKolmogorov-Smirnov test (Chakravart, Laha, and Roy, 1967) is widely used for comparing a sample with a reference probability distribution (one-sample K-S test), or for comparing two samples (two-sample K-S test). Since it is highly sensitive to distinguish the difference of the empirical cumulative distribution functions of the tested samples, it has been considered as one of the most useful nonparametric methods for comparing two samples. Kolomogorov-Smirnov test is based on the distance between the empirical distribution function of the sample and the cumulative distribution function of the reference distribution, or between the two samples empirical distribution functions.Kolmogorov-Smirnov test has been modified to serve as a goodness of fit test here. The modified Komogorov-Smirnov D statistic is given bywhere is the empirical distribution function of the sample and is the fitted distribution.Anderson-Darling statisticAnderson-Darling test (Stephens, 1974) is used to test whether the data follow a particular distribution. It is named after Theodore Wilb ur Anderson and Donald A. Darling who proposed it in 1952. Anderson-Darling test is based on Kolmogorov-Smirnov test and enjoys sea captain property of more sensitive due to specific distribution application in calculating critical values. Contrarily, its disadvantage is that critical values must be calculated for each distribution.Anderson-Darling test has been modified to serve as a goodness of fit test here. The modified Anderson-Darling statistic is given byWhere and is the th-order statistic.Shapiro-Wilk-Type correlation statisticsIn statistics, the Shapiro-Wilk test is used as the tester of goodness-of-fit test of normal distribution. It has the superior power compared with other statistics in detecting the data comes from a relatively wide range of other distributions for testing goodness-of-fit of normal distributions which has been proposed from Monte-Carlo study of Shapiro and others (1968).Shapiro-Wilk test has been modified to serve as a goodness of fit test here. The s tatistic first proposed by Shapiro and Francia (1972) but with approximate scores suggested by Filliben (1975) and modified form fitting for good-of-fit test for Weibull distributions is given byAndIs the median score, in the spirit of Filliben, except that these scores depend upon the maximum likelihood estimate the Weibull shape parameter .If a variable has the two-parameter Weibull distribution, the variable has an extreme value distribution. Calculating goodness-of-fit on this scale has advantages. Since the extreme value distributions are defined by location and scale parameters, the critical values for the correlation statistic are not dependent on the true shape parameter. Thus, for two-parameter Weibull distribution, we propose the correlation-type statistic , whereAndSimulation Result
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.