999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

The Application of Bagging for Solving Small-Businesses’ Credit Assessment Problems Based on Daily T

2009-04-29 00:00:00DONGYan-wen
中國管理信息化 2009年15期

Abstract: Although many efforts have been made towards assessing credit risk, and various models and methods for credit assessment have been developed, most of them are based on financial data or stock price and some investigation data provided by special inquiry agencies. Because most of minor and small businesses do not disclose their financial information, it is almost impossible to assess their credit using models or methods published so far. Here we have proposed a new approach to assess the customers’ credit only based on daily transaction data such as sales, payments by customers, amount of overdue payment, etc. This paper proposes a credit assessment system using bagging method. It aims to deal with the issue that the number of unsound customers is much less than that of healthy ones and improve the ability for identifying unsound customers. The performance and effectiveness of the proposed system is confirmed by applying it to the real problems.

Key words: Ensembles Bagging; Customer Evaluation; Credit Risk; Credit Scoring

doi:10.3969/j.issn.1673-0194.2009.15.035

CLC number: TP224;F275Article character:AArticle ID:1673-0194(2009)15-0115-05

1 Introduction

In today’s increasingly competitive business environment, risk management has become an important topic. Credit risk is one of management risks being faced frequently and most simply defined as the potential that counterparty will fail to meet its obligations in accordance with agreed terms. Because there are many types of counterparties from individuals to sovereign governments and many different types of obligations from auto loans to derivatives transactions credit risk takes many forms. There have been significant prior researches on credit analysis or credit evaluation. The models and methodologies published so far for credit risk assessment generally fall into two categories: default models and credit scoring models. Default models assess the likelihood of default by an obligor. Credit scoring models are used to assess the credit quality of counterparty.

The application of statistical techniques to credit analysis started in the 1960’s with the development of computers. Beaver[1] is one of the first researchers to introduced discriminant analysis (DA) to study bankruptcy prediction. From the 1980’s, the DA method was replaced by other statistical techniques such as logit analysis, probit analysis, multidimensional scaling and so on[2][3]. Artificial Intelligence (AI) techniques, particularly rule-based expert systems, case-based reasoning (CBR) systems and machine learning techniques such as neural networks have been used to credit rating or bankruptcy analysis. Recently, researchers have proposed the hybrid data mining approach in the design of an effective credit scoring model[4-6].

Credit scoring models vary regarding the type and quantity of the data needed for decision making, most of prior studies are limited to be applied mainly in financial community companies such as commercial banks where the customers are usually required to submit their financial data and/or others. However, many of companies of non-financial community cannot require financial data and/or others from their customers. It is difficult in particular to obtain the customers’ data in the situation where the customers are small businesses without disclosing financial information. Therefore, most of models and methods for credit assessment, which are suitable and effective to commercial banks or large business, cannot always be applied to small business.

To deal with the customers’ credit assessment problem in a small company, we have proposed some new approaches to assess the customers’ credit only based on daily transaction data such as sales, payments by customers, amount of overdue payment, etc[7] [8]. Because the data can be extracted almost automatically from the database of the management information system, these approaches are suitable to be applied to many of organizations where the customers do not disclose their financial data and have an advantage of a low cost to collect data over other ones published so far.

However, some new issues have to be addressed. One of them is that daily transaction data contains usually a large amount of transaction records and so it is necessary to reduce the data size to improve the learning efficiency. Another issue is how to improve the ability for identifying the unsound customers. Because the number of unsound customers is much less than that of healthy ones, it is more important to indentify unsound customers more accurately than improving the overall accuracy. In order to deal with these issues, this paper proposes a credit assessment system using bagging method. The architecture of the system is described by giving the procedure for generating multiple versions of a customer’s credit assessment through multivariate discriminant analysis (MDA) and aggregating these to get a final assessment through a plurality vote. The performance and effectiveness of the proposed system is confirmed by applying it to the real problems of the company.

Table 1 Credit scores

ScoreCustomers

1A healthy customer for which all orders are accepted.

2A customer for which orders are accepted but limited to a given amount.

3A customer for which orders are accepted only in cash sale.

4An unsound customer for which all of orders are rejected.

2 Credit Assessment Problem in a Small Company

This paper considers the credit assessment problem in a small company that the main business is selling school uniforms and accessories at wholesale. There are 20 employees in the company, and the annual sale is about 600 million Japanese yen. Orders come from about 800 customers that are classified into three types: retailer, school and others. The customers’ credit has been assessed through a four-grade credit score as shown in Table 1.

Because most of the customers are minor small businesses without disclosure of financial information, it is almost impossible to obtain their financial data. It is also difficult to frequently ask an agency for evaluating customers’ credit due to limited budget. For these reasons, it is obviously preferable to develop a system that be able to assess the customers’ credit only based on daily transaction data.

Figure 1 Scheme of credit assessment system using bagging

3 Features Data Extraction

Considering the available data that can be obtained from the company, we collected eight basic features and represent customer Ci (i=1,2, ,n) by the following data structure:

Ci: (xi1, xi2, xi3, , xi8)(1)

where n is the number of customers, xij (j=1,2, ,8) are the features of customer Ci defined as the following:

xi1, xi2: 0-1 variables representing the type of customer Ci as shown in Table 2.

Table 2 0-1 Variables xi1 and xi2

Type of customers xi1xi2

retailer00

school01

other10

 xi3: average amount of overdue payment in the year considered.

 xi4: maximum overdue days for all of overdue payment in the year considered.

 xi5: number of times that overdue payment occurs in the year considered.

 xi6: total sales in the year considered.

 xi7: rate of the average amount of overdue payment to the total sales, i.e. xi7 =xi3 / xi6.

 xi8: number of transaction months that any order from the customer is fulfilled in the year considered.

4 Credit Assessment System Using Bagging

4.1 System scheme

Bagging, which stands for bootstrap aggregating, is one of the earliest and perhaps the simplest ensemble based learning algorithms, with a surprisingly good performance [9]. Diversity of classifiers in bagging is obtained by using bootstrapped replicas of the training data. That is, different training data subsets are randomly drawn with replacement from the entire training dataset. Each training data subset is used to train a different classifier of the same type. Individual classifiers are then combined by taking a simple majority vote of their decisions.

Here the credit assessment system using bagging is proposed and showed Figure 1.

4.2 Procedure for credit assessment

When assessing customers’ credit, the number of healthy customers is usually much larger than that of unsound ones. If the entire dataset of all customers is used as the training dataset, the features of healthy customers are often over-learned but the unsound ones are frequently under learned. Practically, it is more important to identify unsound customers than doing healthy ones in order to control credit risk as low as possible. Considering this aspect, a revision is made in the sampling method of the standard bagging algorithm and bagging procedure for credit assessment is proposed as follows.

[Step 1] Data preparation

The entire dataset is S={ (Ci, csi) | i=1,2, ,n }, where Ci is the features data of customers Ci defined in equation (1) and csi (csi∈{1,2,3,4}) is its credit score. Divide S into two sub dataset S1 and S2 as:

S1={ (Ci, csi) | csi =1, i=1,2, ,n } (2)

S2={ (Ci, csi) | csi >1, i=1,2, ,n } (3)

S1 means the subset of healthy customers and S2 represents the subset of unsound customers. The sizes of dataset S1 and S2 are n1 and n2 respectively.

[Step 2] Sampling with replacement (bootstrap)

Let N be the number of bootstrapped replicas, and build the replica Bk (k=1,2,…,N) by drawing randomly p customers and q customers-with replacement-from the dataset S1 and the dataset S2 respectively.

[Step 3] Weak learning

Using replica Bk (k=1,2, ,N) as the training set that Ci be the input variables and csi be the class label, the Classifier Hk can be constructed by multivariate discriminant analysis, which consists of four discriminant functions. The ensemble E can be obtained as E={ Hk | k=1,2, ,N }.

[Step 4] Majority voting

Given a target customer NC, its features data can be collected and denoted as:

NC: (z1, z2, …, zm)(4)

where z1, z2,…, zm have the same definitions as xi1, xi2,…,xim. Then its credit score cf can be decided by the following steps:

[Step 4.1] Let Gj (j=1,2,3,4) be the class label that is corresponding to the credit score of j. Applying the Classifier Hk to the target customer NC, the class label of NC can be given as cfk (cfk∈{G1,G2,G3,G4}).

[Step 4.2] Let νkj=1,if cfk=Gj0, ptherwise and νj=∑Nk=1νkj, the credit score cf is chosen by taking a simple majority vote as:

cf=Argmax {vj, j=1,2,3,4}(5)

5 Experiment and Discussion

To investigate the performance and effectiveness of the proposed system, we apply the proposed system to the real credit assessing problems of the company. The features data of the customers in 2001 financial year and their credit scores given by financial managers of the company are collected as shown in Table 3.

Table 3 Features data of 498 customers

FeaturesMeanStandard deviationNumber of customers

xi3196 171762 532Credit scoreNumber

xi456.3773.301474

xi52.923.0622

xi61 201 3903 706 05732

xi70.220.37420

xi85.564.14Total498

Let p=24, q=24 and N=5, 10, 24, we apply the proposed system to classify the customers of Table 3, and further the classification results are compared with that given by the financial managers of the company. The comparison results are shown in Table 4. The credit scores for the customers with score of 2, 3 and 4 provided by the system are 100%, 50% and 90% respectively in agreement with the judgments of the financial managers of the company, and the hit rate is not sensitive to N (the number of classifier).

The classification result based on all of customers’ data is also obtained as shown in Table 5. Comparing the hit rates of Table 4 and Table 5, it is obvious that the proposed system can classify unsound customers more effectively than ordinary discriminant analysis. But the system gave lower hit rate for classifying the healthy customers.

6 Concluding Remarks

This paper dealt with customers’ credit scoring problems in a small company and intended to assess the customer’s credit based only on daily transaction data such as sales, payments by customers, amount of overdue payment, etc. A credit assessment system using bagging method was proposed and the performance and effectiveness is confirmed by applying it to the real problems of the company. The experiment results showed that the system can classify unsound customers more effectively than ordinary discriminant analysis.

Breiman[10] reported that linear discriminant analysis is of stable classifiers and has low variance, but can have high bias and therefore bagging cannot work well. It can be expected that the performance of the system can be further improved by introducing appropriate unstable classifiers.

Acknowledgement

This work was partly supported by grant-in-aid for scientific research (C) from the japan society for the promotion of science under grant no. 19530324.

References

[1] W Beaver. Financial Ratios as Predictor of Failure[J].Journal of Accounting Research (Supplement), 1966, 4: 71-111.

[2] L H Ederington. Classification Models and Bond Ratings[J]. The Financial Review, 1985, 20(4):237-262.

[3] M C Mar, C G Apellaniz, C S Cinca.A Multivariate Study of Spanish Bond Ratings[J]. Omega, 1996, 24(4): 451-462, 1996.

[4] D West.Neural Network Credit Scoring Models[J]. Computers and Operations Research, 2000,27(11/12): 1131-1152.

[5] B Baesens, et al.Benchmarking State-of-the-art Classification Algorithms for Credit Scoring[J]. Journal of the Operational Research Society, 2003, 54(6): 627-635.

[6] N C Hsieh. Hybrid Mining Approach in the Design of Credit Scoring Models[J]. Expert Systems with Applications, 2005, 28(4): 655-665.

[7] Y DONG. A Case Based Reasoning System for Evaluating Customer Credit[J]. Journal of Japan Industrial Management Association, 2006, 57(2): 144-152 (In Japanese language).

[8] Y DONG. Development of a Customer Credit Evaluation System via Case-based Reasoning Approach[J]. Asia-Pacific Journal of Industrial Management, 2008, 1(1): 1-7.

[9] L Breiman. Bagging predictors[J]. Machine Learning, 1996, 24(2): 123-140.

[10] L Breiman. Bias, Variance, and Arcing Classifiers[R]. Technical Report 460, Statistics Department, University of California, 1996: 1-25.

主站蜘蛛池模板: 成年网址网站在线观看| 久久九九热视频| 国产又色又爽又黄| 欧美国产菊爆免费观看| 日本高清在线看免费观看| a网站在线观看| 日韩高清在线观看不卡一区二区| 欧美国产在线精品17p| 视频在线观看一区二区| 亚洲欧洲日韩国产综合在线二区| 伦精品一区二区三区视频| 亚洲精品成人7777在线观看| 欧美国产日韩另类| 91精品国产麻豆国产自产在线| 免费看美女毛片| 福利在线一区| 国产成年无码AⅤ片在线| 一级毛片a女人刺激视频免费| 伊在人亚洲香蕉精品播放| 久久香蕉国产线看观看式| 欧美亚洲日韩不卡在线在线观看| 五月天天天色| 五月天香蕉视频国产亚| 欧美日韩激情| 久久黄色一级视频| 国产第一色| 黄色不卡视频| 欧美特级AAAAAA视频免费观看| 欧美一级专区免费大片| 日本不卡在线| 2021国产在线视频| 国产精品30p| 日韩毛片基地| 国产精品自在线天天看片| 在线观看国产小视频| 天天做天天爱夜夜爽毛片毛片| 久久国产热| 国产夜色视频| 色香蕉网站| 亚洲二区视频| 国产精品久久自在自线观看| 午夜电影在线观看国产1区| 国产黄色片在线看| 久久a级片| 99精品国产高清一区二区| 波多野结衣第一页| 美女被操黄色视频网站| 中文毛片无遮挡播放免费| 日韩少妇激情一区二区| 国产你懂得| 好紧太爽了视频免费无码| 草草线在成年免费视频2| 58av国产精品| 高清视频一区| 欧美激情成人网| 欧美一区二区三区香蕉视| 91亚洲精品国产自在现线| 91精品伊人久久大香线蕉| 色综合天天综合中文网| 98精品全国免费观看视频| 精品福利网| AV在线麻免费观看网站| 国产一二三区在线| 国产精品观看视频免费完整版| 国产91九色在线播放| 色综合天天视频在线观看| 亚洲毛片在线看| 欧美一区福利| 国产精品性| 国产欧美日韩综合一区在线播放| 国产91无码福利在线| 四虎影视无码永久免费观看| 91精品在线视频观看| 国产成人亚洲精品色欲AV| 婷婷亚洲视频| 91日本在线观看亚洲精品| 亚洲免费福利视频| 99色亚洲国产精品11p| 色综合婷婷| 久久综合伊人 六十路| 91精品最新国内在线播放| 69av在线|