European motor TPL insurance
euMTPL.RdThe euMTPL compiles three years of experience from a European MTPL (Motor Third Party Liability) portfolio,
including frequency and severity values for different types of losses. The data was collected during the
first decade of the 21st century.
Usage
data(euMTPL)Format
euMTPL is a data frame with 2,373,197 rows and 19 columns:
policy_idUnique identifier for each policy.
yearCalendar year of the policy.
groupData split into training, validation, and test sets using a 70/10/20 ratio.
fuel_typeFuel type of the insured vehicle.
vehicle_categoryCategory of the insured vehicle.
vehicle_useIntended use of the vehicle (e.g., personal, commercial).
provinceProvince of residence of the policyholder.
horsepowerPower output of the insured vehicle, measured in horsepower.
genderGender of the policyholder.
ageAge of the policyholder at the start date of the policy.
exposureFraction of the year that the policy was in effect.
cost_ncTotal claim amount for No Card (NC) claims.
num_ncNumber of No Card (NC) claims.
cost_cgTotal claim amount for Card Gestionario (CG) claims.
num_cgNumber of Card Gestionario (CG) claims.
cost_cdTotal claim amount for Card Debitore (CD) claims.
num_cdNumber of Card Debitore (CD) claims.
cost_fcdTotal claim amount for Forfait Card Gestionario (FCD) claims.
num_fcdNumber of Forfait Card Gestionario (FCD) claims.
Examples
# (1) load of data
#
data(euMTPL)
head(euMTPL)
#> # A tibble: 6 × 19
#>   policy_id group fuel_type  year vehicle_category vehicle_use province
#>       <int> <chr> <fct>     <dbl> <fct>            <fct>       <fct>   
#> 1         1 test  B             7 1                1           PA      
#> 2         2 train B             7 1                1           NA      
#> 3         4 train B             7 1                1           CN      
#> 4         5 train B             7 1                1           NA      
#> 5         6 train B             7 1                1           NA      
#> 6         8 train B             7 1                1           NA      
#> # ℹ 12 more variables: horsepower <int>, gender <fct>, age <int>,
#> #   exposure <dbl>, cost_nc <dbl>, num_nc <int>, cost_cg <dbl>, num_cg <int>,
#> #   cost_fcg <dbl>, num_fcg <int>, cost_cd <dbl>, num_cd <int>