main content

bootstrap using chain ladder method -凯发k8网页登录

this example shows how to apply a chain ladder bootstrap method to generate several objects to estimate the ultimate claims.

deterministic claim estimation methods produce point estimates of reserve values with no information about the uncertainty of these estimates. the goal of a stochastic claim estimation method is to assess the variability of estimated reserve values. the chain ladder bootstrapping approach is a simulation-based method to randomly modify the data and produce a distribution of estimated reserves that represents the variability of the estimated reserve values. this example is based on the work of wüthrich and merz [1].

load data

load('insuranceclaimsdata.mat');
disp(head(data));
    originyear    developmentyear    reportedclaims    paidclaims
    __________    _______________    ______________    __________
       2010             12               3995.7          1893.9  
       2010             24                 4635          3371.2  
       2010             36               4866.8          4079.1  
       2010             48               4964.1            4487  
       2010             60               5013.7          4711.4  
       2010             72               5038.8          4805.6  
       2010             84                 5059          4853.7  
       2010             96               5074.1          4877.9  

create developmenttriangle

create a object and use to visualize the developmenttriangle. for more information on unpaid claims estimation, see .

dtriangle = developmenttriangle(data);
dtriangletable = view(dtriangle);
% visualize the development triangle
claimsplot(dtriangle)

figure contains an axes object. the axes object with title cumulative claims development, xlabel development period, ylabel claims contains 10 objects of type line. these objects represent 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019.

analyze the developmenttriangle

the link ratios are estimated using the formula:

fjˆ=i=0i-j-1ci,j 1i=0i-j-1ci,j

use to calculate the age-to-age factors.

factorstable = linkratios(dtriangle);

use to calculate the averages of the age-to-age factors.

averagefactorstable = linkratioaverages(dtriangle);
disp(averagefactorstable);
                                          12-24     24-36     36-48     48-60     60-72     72-84     84-96    96-108    108-120
                                          ______    ______    ______    ______    ______    ______    _____    ______    _______
    simple average                        1.1767    1.0563    1.0249    1.0107    1.0054    1.0038    1.003    1.002      1.001 
    simple average - latest 5              1.172     1.056    1.0268    1.0108    1.0054    1.0038    1.003    1.002      1.001 
    simple average - latest 3               1.17    1.0533     1.027    1.0117    1.0057    1.0037    1.003    1.002      1.001 
    medial average - latest 5x1           1.1733    1.0567    1.0267    1.0103     1.005     1.004    1.003    1.002      1.001 
    volume-weighted average               1.1766    1.0563     1.025    1.0107    1.0054    1.0038    1.003    1.002      1.001 
    volume-weighted average - latest 5     1.172     1.056    1.0268    1.0108    1.0054    1.0038    1.003    1.002      1.001 
    volume-weighted average - latest 3    1.1701    1.0534     1.027    1.0117    1.0057    1.0037    1.003    1.002      1.001 
    geometric average - latest 4            1.17     1.055    1.0267     1.011    1.0055    1.0037    1.003    1.002      1.001 

display the selected age-to-age factors table and calculate the cumulative development factor (cdf) using .

dtriangle.selectedlinkratio = averagefactorstable{'volume-weighted average',:};
currentselectedfactors = dtriangle.selectedlinkratio;
dtriangle.tailfactor = 1;
selectedfactorstable = cdfsummary(dtriangle);
disp(selectedfactorstable);
                                12-24      24-36      36-48      48-60      60-72     72-84      84-96     96-108     108-120    ultimate
                               _______    _______    _______    _______    _______    ______    _______    _______    _______    ________
    selected                    1.1766     1.0563      1.025     1.0107     1.0054    1.0038      1.003      1.002     1.001        1    
    cdf to ultimate             1.3072      1.111     1.0518     1.0261     1.0153    1.0098      1.006      1.003     1.001        1    
    percent of total claims    0.76501    0.90008    0.95075    0.97453    0.98496    0.9903    0.99402    0.99701     0.999        1    

display the latest diagonal.

latestdiagonal = dtriangle.latestdiagonal;

compute the projected ultimate claims using .

projectedultimateclaims = ultimateclaims(dtriangle);

display the full development triangle using .

fulltriangletable = fulltriangle(dtriangle);
disp(fulltriangletable);
              12        24        36        48        60        72        84        96       108       120      ultimate
            ______    ______    ______    ______    ______    ______    ______    ______    ______    ______    ________
    2010    3995.7      4635    4866.8    4964.1    5013.7    5038.8      5059    5074.1    5084.3    5089.4     5089.4 
    2011      3968    4682.3    4963.2    5062.5    5113.1    5138.7    5154.1    5169.6    5179.9    5185.1     5185.1 
    2012      4217    5060.4      5364    5508.9    5558.4    5586.2    5608.6    5625.4    5636.7    5642.3     5642.3 
    2013    4374.2    5205.3    5517.7    5661.1    5740.4    5780.6    5803.7    5821.1    5832.7    5838.6     5838.6 
    2014    4499.7    5309.6    5628.2    5785.8    5849.4    5878.7    5900.8    5918.5    5930.3    5936.3     5936.3 
    2015    4530.2    5300.4    5565.4    5715.7    5772.8    5804.1    5825.9    5843.4    5855.1      5861       5861 
    2016    4572.6    5304.2    5569.5    5714.3    5775.4    5806.7    5828.6    5846.1    5857.7    5863.6     5863.6 
    2017    4680.6    5523.1    5854.4    6000.9    6065.1      6098    6120.9    6139.3    6151.6    6157.7     6157.7 
    2018    4696.7    5495.1    5804.4    5949.6    6013.3    6045.9    6068.6    6086.8      6099    6105.1     6105.1 
    2019    4945.9    5819.2    6146.7    6300.5    6367.9    6402.4    6426.5    6445.8    6458.7    6465.2     6465.2 

compute the total reserves using .

ibnr = ultimateclaims(dtriangle) - dtriangle.latestdiagonal;
ibnr = array2table(ibnr, 'rownames', dtriangletable.properties.rownames, 'variablenames', {'ibnr'});
ibnr{'total',1} = sum(ibnr{:,:});
disp(ibnr);
              ibnr 
             ______
    2010          0
    2011     5.1857
    2012      16.89
    2013     34.886
    2014     57.583
    2015     88.148
    2016     149.34
    2017     303.29
    2018     609.99
    2019     1519.3
    total    2784.6

bootstrap chain ladder

to derive the resampling approaches, the time series model of the distribution-free chain ladder (cl) model is defined as:

ci,j 1=fjci,j σjci,jϵi,j 1

for the link ratio selected above, wüthrich [1] and mack [2] show that the standard deviation is estimated as:

σjˆ2=1i-j-1i=0i-j-1ci,j(ci,j 1ci,j-fjˆ)2

σj-1ˆ2=min{σj-2ˆ4σj-3ˆ3;σj-3ˆ2;σj-2ˆ2}

estimatedstandarddeviations = currentselectedfactors;
for i=1:width(estimatedstandarddeviations)-1
    estimatedstandarddeviations(1,i) = sqrt(sum(((factorstable{1:end-i,i} - currentselectedfactors(:,i)).^2).*dtriangletable{1:end-i,i}) / (height(dtriangletable)-i-1));
end
estimatedstandarddeviations(1,end) = sqrt(min([estimatedstandarddeviations(1,end-1)^4 / estimatedstandarddeviations(1,end-2)^2, estimatedstandarddeviations(1,end-2)^2, estimatedstandarddeviations(1,end-1)^2]));
disp(estimatedstandarddeviations);
    0.8667    0.3699    0.2420    0.1310    0.0673    0.0361    0.0001    0.0001    0.0001

to apply the bootstrap method, you need to find the appropriate residuals that allow for the construction of the empirical distribution fnˆ to construct the bootstrap observations.

consider the following residuals for i ji,j1.

ϵi,j=fi,j-fj-1ˆσj-1ci,j-1-1/2 where fi,j=ci,jci,j-1

following wüthrich [1], you can scale the residuals to adjust their variance upwards. unscaled residuals tend to result in lighter tails in the simulated distribution.

adjust the residuals such that the bootstrap distribution has an adjusted variance function.

zi,j=(1-ci,j-1i=0i-jci,j-1)-12fi,j-fj-1ˆσj-1ˆci,j-1-12

you can apply the bootstrap algorithm using three different versions:

  • efron's nonparametric bootstrap for residuals ϵi,j

  • efron's nonparametric bootstrap for scaled residuals zi,j

  • parametric bootstrap under the assumption that the residuals have a standard gaussian distribution, that is zi,j* is resampled from n(0,1)

this example uses the second version (efron's nonparametric bootstrap for scaled residuals) to calculate zi,j.

% create a copy of the factors table and modify it to create the
% residuals table
residuals = factorstable.variables;
colsums = sum(dtriangle.claims,'omitnan');
for i=1:height(residuals)
    for j=1:width(residuals)   
        residuals(i,j) = (1 - (dtriangletable{i,j}/colsums(j)))^-0.5 * (factorstable{i,j} - currentselectedfactors(1,j)) / (estimatedstandarddeviations(1,j)*(dtriangletable{i,j}^-0.5));
    end
end

the residuals {zi,j,i ji} define a bootstrap distribution.

residualsvector = residuals(:);
residualsvector(isnan(residualsvector)) = [];
histogram(residualsvector,10)
title('scaled residuals')
xlabel('residual value')
ylabel('frequency')

figure contains an axes object. the axes object with title scaled residuals, xlabel residual value, ylabel frequency contains an object of type histogram.

to simulate a new reserves scenario with the bootstrap method, follow these steps.

step 1: resample a triangle of residuals from the bootstrap distribution.

resample the independent and identically distributed (i.i.d.) residuals{z*i,j,i ji} from the bootstrap distribution.

resampledresiduals = residuals;
rng('default');
rng(1);
for i = 1:height(residuals)-1
    for j = 1:width(residuals)-i 1
        resampledresiduals(i,j) = datasample(residuals(~isnan(residuals)), 1);
    end
end
disp(resampledresiduals);
   -1.5522   -0.5120   -1.2668    0.7776   -1.3649    0.2799   -0.5495   -1.3146   -1.5364
   -0.4041   -1.5522   -0.4784   -1.2189   -0.7591    0.2610   -0.4784   -1.5522       nan
   -0.4091   -1.3649   -0.5495   -1.6767   -0.8571   -1.3143   -0.4879       nan       nan
   -0.7591    1.3226    1.0791    0.2610    0.2861   -0.7591       nan       nan       nan
    0.2799   -1.5522   -0.8571    0.3243   -0.4879       nan       nan       nan       nan
   -1.3143   -0.4784    0.5556   -1.2668       nan       nan       nan       nan       nan
    1.9550         0    1.9550       nan       nan       nan       nan       nan       nan
    0.7693    0.5169       nan       nan       nan       nan       nan       nan       nan
    0.2799       nan       nan       nan       nan       nan       nan       nan       nan
       nan       nan       nan       nan       nan       nan       nan       nan       nan

step 2: compute bootstrapped claims.

define ci,0*=ci,0 and, for j1, assume that:

ci,j*=fj-1ˆci,j-1* σj-1ˆci,j-1*zi,j*

this expression represents the new simulated claim values. using the simulated claim values, you can create a new to estimate new reserve values.

bootstrappedclaims = dtriangletable.variables;
for j = 2:width(bootstrappedclaims)
    bootstrappedclaims(:,j) = currentselectedfactors(1,j-1).*bootstrappedclaims(:,j-1)   estimatedstandarddeviations(1,j-1).*sqrt(bootstrappedclaims(:,j-1)).*resampledresiduals(:,j-1);
end
stackedclaims = reshape(bootstrappedclaims',100,1);
stackedclaims = stackedclaims(~isnan(stackedclaims));
newdata = data;
newdata.values = stackedclaims;
bootstrappeddevelopmenttriangle = developmenttriangle(newdata,'claims','values');

step 3: select a link ratio consistent with the model.

the volume-weighted average is the link ratio that is consistent with the model used in this bootstrap approach.

bootstrappedaveragefactorstable = linkratioaverages(bootstrappeddevelopmenttriangle);
bootstrappeddevelopmenttriangle.selectedlinkratio = bootstrappedaveragefactorstable{'volume-weighted average',:};
bootstrappeddevelopmenttriangle.tailfactor = 1;
bootstrappedselectedfactorstable = cdfsummary(bootstrappeddevelopmenttriangle);
disp(bootstrappedselectedfactorstable);
                                12-24      24-36     36-48      48-60      60-72     72-84      84-96     96-108     108-120    ultimate
                               _______    _______    ______    _______    _______    ______    _______    _______    _______    ________
    selected                    1.1751      1.054    1.0253     1.0099     1.0048    1.0036      1.003      1.002     1.001        1    
    cdf to ultimate              1.301     1.1072    1.0504     1.0245     1.0145    1.0096      1.006      1.003     1.001        1    
    percent of total claims    0.76861    0.90321     0.952    0.97609    0.98572    0.9905    0.99403    0.99701     0.999        1    

use to display the full development triangle corresponding to the selected link ratio.

bootstrappedfulltriangle = fulltriangle(bootstrappeddevelopmenttriangle);
disp(bootstrappedfulltriangle);
              12        24        36        48        60        72        84        96       108       120      ultimate
            ______    ______    ______    ______    ______    ______    ______    ______    ______    ______    ________
    2010    3995.7    4616.2    4863.2    4963.4    5023.7    5044.5    5064.1    5079.3    5089.5    5094.6     5094.6 
    2011      3968    4646.6      4869    4982.8    5024.8    5048.4    5068.1    5083.3    5093.4    5098.5     5098.5 
    2012      4217    4938.6    5181.1    5301.1    5341.9    5366.6    5383.3    5399.5    5410.2    5415.6     5415.6 
    2013    4374.2    5103.1    5425.3    5580.2    5642.5    5674.5    5693.8    5710.9    5722.3      5728       5728 
    2014    4499.7    5310.5    5567.5    5691.3    5755.4    5784.2    5804.8    5822.2    5833.8    5839.6     5839.6 
    2015    4530.2    5253.5    5536.3    5684.8    5733.2      5761    5781.5    5798.8    5810.4    5816.2     5816.2 
    2016    4572.6    5494.6    5803.9    5985.1    6044.2    6073.5    6095.1    6113.4    6125.6    6131.7     6131.7 
    2017    4680.6    5552.6    5879.4    6028.2    6087.7    6117.2      6139    6157.4    6169.7    6175.9     6175.9 
    2018    4696.7    5542.6      5842    5989.8    6048.9    6078.2    6099.9    6118.2    6130.4    6136.5     6136.5 
    2019    4945.9      5812      6126      6281      6343    6373.7    6396.4    6415.6    6428.4    6434.8     6434.8 

step 4: compute the total reserves.

compute the total reserves from the simulated .

bootstrappeddevelopmenttriangletable = view(bootstrappeddevelopmenttriangle);
bootstrappedibnr = ultimateclaims(bootstrappeddevelopmenttriangle) - bootstrappeddevelopmenttriangle.latestdiagonal;
bootstrappedibnr = array2table(bootstrappedibnr, 'rownames', bootstrappeddevelopmenttriangletable.properties.rownames, 'variablenames', {'ibnr'});
bootstrappedibnr{'total',1} = sum(bootstrappedibnr{:,:});
disp(bootstrappedibnr);
              ibnr 
             ______
    2010          0
    2011     5.0881
    2012     16.188
    2013     34.197
    2014     55.485
    2015     83.048
    2016     146.61
    2017     296.45
    2018     593.94
    2019       1489
    total      2720

you can repeat the previous steps many times to genreate a full, simulated, distribution of reserves. the simulation produces reserves for each year and for the total reserves.

simulate multiple bootstrapped scenarios

create 1000 bootstrapped development triangles and calculate the incurred-but-not-reported (ibnr) for each .

n = 1000; 
simulatedibnr = zeros(10,n);
for i = 1:n
    simulatedresiduals = residuals;
    
    for j = 1:height(residuals)-1
        for k = 1:width(residuals)-j 1
            simulatedresiduals(j,k) = datasample(residuals(~isnan(residuals)),1);
        end
    end
    
    simulatedclaims = dtriangletable.variables;
    for j = 2:width(simulatedclaims)
        simulatedclaims(:,j) = currentselectedfactors(1,j-1).*simulatedclaims(:,j-1)   estimatedstandarddeviations(1,j-1).*sqrt(simulatedclaims(:,j-1)).*simulatedresiduals(:,j-1);
    end
        
    simulatedclaims = reshape(simulatedclaims',100,1);
    simulatedclaims = simulatedclaims(~isnan(simulatedclaims));
    simulateddata = data;
    simulateddata.reportedclaims = simulatedclaims;
    simulateddevelopmenttriangle = developmenttriangle(simulateddata);
    
    simulatedaveragefactorstable = linkratioaverages(simulateddevelopmenttriangle);
    simulateddevelopmenttriangle.selectedlinkratio = simulatedaveragefactorstable{'volume-weighted average',:};
    simulateddevelopmenttriangle.tailfactor = 1;
    simulatedlatestdiagonal = simulateddevelopmenttriangle.latestdiagonal;
    simulatedprojectedultimateclaims = ultimateclaims(simulateddevelopmenttriangle);
    simulatedibnr(:,i) = simulatedprojectedultimateclaims - simulatedlatestdiagonal;
    
end
simulatedibnr(end 1,:) = sum(simulatedibnr);

select a year to plot the distribution of the ibnr, calculate the mean, and compare that mean to a calculated deterministic value.

originyear =5;
histogram(simulatedibnr(originyear 1,:));
hold on;
plot(mean(simulatedibnr(originyear 1,:)),0,'o','linewidth',2)
plot(ibnr{originyear 1,1},0,'x','linewidth',2);
legend('simulated ibnr',['simulated mean : ' num2str(round(mean(simulatedibnr(originyear 1,:)),2))],['deterministic ibnr : ' num2str(round(ibnr{originyear 1,1},2))]);
hold off;

figure contains an axes object. the axes object contains 3 objects of type histogram, line. one or more of the lines displays its values using only markers these objects represent simulated ibnr, simulated mean : 88.91, deterministic ibnr : 88.15.

plot a histogram of the totals for ibnrs, simulated means, and deterministic values.

histogram(simulatedibnr(11,:));
hold on;
plot(mean(simulatedibnr(11,:)),0,'o','linewidth',2)
plot(ibnr{11,1},0,'x','linewidth',2);
legend('simulated total ibnr',['simulated mean : ' num2str(round(mean(simulatedibnr(11,:)),2))],['deterministic total ibnr : ' num2str(round(ibnr{11,1},2))]);
hold off;

figure contains an axes object. the axes object contains 3 objects of type histogram, line. one or more of the lines displays its values using only markers these objects represent simulated total ibnr, simulated mean : 2785.98, deterministic total ibnr : 2784.57.

references

  1. wüthrich, mario, and michael merz. stochastic claims reserving methods in insurance. hoboken, nj: wiley, 2008

  2. mack, thomas. "distribution-free calculation of the standard error of chain ladder reserve estimates." astin bulletin. vol. 23, no. 2, 1993.

see also

| | | | | | | | | | | |

related topics

    网站地图