Home Mathematics A study on predicting crime rates through machine learning and data mining using text
Article Open Access

A study on predicting crime rates through machine learning and data mining using text

  • Ruaa Mohammed Saeed EMAIL logo and Husam Ali Abdulmohsin
Published/Copyright: March 31, 2023
Become an author with De Gruyter Brill

Abstract

Crime is a threat to any nation’s security administration and jurisdiction. Therefore, crime analysis becomes increasingly important because it assigns the time and place based on the collected spatial and temporal data. However, old techniques, such as paperwork, investigative judges, and statistical analysis, are not efficient enough to predict the accurate time and location where the crime had taken place. But when machine learning and data mining methods were deployed in crime analysis, crime analysis and predication accuracy increased dramatically. In this study, various types of criminal analysis and prediction using several machine learning and data mining techniques, based on the percentage of an accuracy measure of the previous work, are surveyed and introduced, with the aim of producing a concise review of using these algorithms in crime prediction. It is expected that this review study will be helpful for presenting such techniques to crime researchers in addition to supporting future research to develop these techniques for crime analysis by presenting some crime definition, prediction systems challenges and classifications with a comparative study. It was proved though literature, that supervised learning approaches were used in more studies for crime prediction than other approaches, and Logistic Regression is the most powerful method in predicting crime.

1 Introduction

Violations of the law pose a danger to the administration of justice and should be curtailed. Computational crime prediction and forecasting can help improve the safety of metropolitan areas. The inability of humans to process large amounts of complicated data from big data makes it difficult to make early and accurate predictions about criminal activity. Computational problems and opportunities arise from accurately predicting crime rates, types, and hot locations based on historical patterns. Still, there is a need for stronger prediction algorithms that target police patrols toward criminal events, despite extensive research efforts [1].

Crime analysis is a methodology approach used to identify crime spots and it is not an easy approach. In year 2020, Geographical Information Systems (GIS) was the non-machine learning tool used earlier for temporal and spatial data. GIS used the crime spots technique that mainly depends on crime type to help reduce crime rates [2].

Crime rate prediction can be defined as a method to build a system for finding crime future patterns and help the law enforcer to solve the crime which lead to reduce its rate in the real-world. Meanwhile, crime forecasting refers to the ability to predict far future crimes, up to years in the future to increase crime preventions, and this can be achieved by using time series approaches to find future crime trends from time series data.

In general, crime analysis in data mining can be predicted using different methods such as statistical methods [3,4,5], cover visualization methods [6,7,8], unsupervised learning, and supervised learning techniques [9,10,11]. Visualization methods include visual explanation of the connection between geographical view and other crime data such as geographic profiling [12], GIS-based crime mapping [13], crime prediction [14,15,16], and asymmetric mapping [17]. However, to obtain the connection between statistical methods, unsupervised learning techniques and crime data such as clustering methods which were very popular. These techniques were implemented as follows: clustering methods were used as criminal behavior analysis [18,19], crime pattern recognition, criminal association analysis, and incident pattern recognition to extract the groups or patterns that had the same features in crime data [20].

Then, the machine learning algorithms’ development helped the crime data analysis researchers to investigate crime depending on preprocessing and clustering techniques to extract the crime locations from row data [21], using the supervised and unsupervised machine learning models to analyze these data and discover their pattern based on time and location of crime to produce precise predictions [22]. In addition, the machine learning algorithms’ development also helped to investigate the reasons of crime occurring in certain areas by applying machine learning algorithms on history data collected from past years in the same area [23].

Nowadays, the development of classification algorithms, especially machine learning algorithms, helps to enhance the crime prediction [24]. Therefore, researchers tried to connect crime with time depending on various factors to help in resolving the crimes and prevent it and its frequencies. In year 2018, Fourier series was proposed as an analytic technique to accomplish a flexible mathematical model on time periodic effects. This technique explained the accuracy and usefulness of analytical techniques to connect the time factor with crime prediction. Thereby, the analytical techniques effectively achieved the relation between crimes and time, but not for all type of crimes [25].

We can say that, machine learning algorithms is widely used in crime prediction discipline, but it is not more than data mining and each one has its own performance and gives a perfect result.

Our work has been setup so that interested parties become familiar with the previous studies and the accuracy that have been achieved, presented in tabular format. The main contribution of this study is presenting machine learning and data mining applications in predicting crimes, by classifying the studies according to different types of techniques, and providing a brief overview of each applicable methodology that has been used to mine crime, and also, enlisting some challenges faced by such system developers.

The limitation of the state of art works are the lack with big entered geo-area, no generality because of using the same system on two different crime datasets leads to different accuracy percentages with big difference, the lack of works that predict criminal action, and finally but not last, the difficulty that faced the researchers in the crime prediction field, that it may be a missing informations in the on-line crime datasets or the data are repeated.

The rest of the review paper is organized as follows: in Section 2, the research methodology of the survey is explained, in Section 3, crime definitions and descriptions are discussed in detail. In Section 4, challenges of prediction system are discussed. In Section 5, the public datasets are described. The related work is included in Section 6. In Section 7, the prediction system classification is introduced. A comparison study of previous works is explained in Sections 8.1 and 8.2. Eventually, discussion and conclusion are presented at the end of this article in Section 9.

2 Research methodology

The methodology involved in this review study contains two stages: first is getting the relevant research works on crime prediction with machine learning and data mining studies and analyze them, and second is setting a classification table in Section 8, and finally presenting a study about the performances of various algorithms and the achieved accuracy and comparison between them.

In choosing relevant research works, any Master and Doctoral dissertations or any papers that were not published were ignored. The research keyword was crime prediction with machine learning and data mining or violent crime prediction, the publishing criteria was between 2001 and 2022, the abstract of every article was read and then determined if it is relevant or not.

3 Crime definition and description

Generally, crimes are classified in to three groups: infractions, felonies, and misdemeanors based on the severity, punishment, and seriousness of crimes. Infractions are minimal crimes such as tailgating, parking overtime, and speeding. Meanwhile, Felonies are considered as most severe crimes followed by misdemeanors which are considered less severe crimes [26]. In addition, the crimes are classified into types based on the time when occurred such as the day, week, month, and season in order to find the connection between these types of crime and then to predict them in the future using machine learning and data mining algorithms. This can be done by using a dataset collected on a certain area for earlier crimes to forecast the future ones.

There are many types of crimes depending on the severity of the crime. Therefore, crimes are classified into three types, which are, felony, misdemeanors, and infractions (or wobblers) [22], as listed and defined in Table 1.

Table 1

Crime description

Crime type Crimes Description
Felony Murder/homicide The non-negligent/willful killing of a person by another. This includes deaths caused by suicide, negligence, accident, assaults to murder, and justifiable homicides (which are scored as aggravated assaults).
Burglary The illegal entry into a structure to commit theft or a felony. Also, includes attempted forcible entry.
Forcible rape Its forcing a female regardless of her age to carnal assault that happens forcibly and against her will. This includes assaults to rape and rape by force.
Illegal drug selling This includes drug trafficking and drug distribution which is selling, transporting, and distributing drugs. It is considered as a federal crime by law; a felony crime that involves serious penalties.
Robbery The attempt to take anything of value from the custody, control, or care of a person by forcing or threatening by force by putting a victim in fear.
Aggravated assault, battery An illegal attack by a person upon another by using a weapon or the victim suffers aggravated bodily injury or obvious severe injuries.
Arson Any malicious burning or willful or attempt to burn, with/without intent to defraud a motor vehicle, dwelling house, public building, aircraft, or a personal property of another.
Forgery The copying, imitating of something, altering, without authority or right, with the intent to defraud or deceive by passing the thing altered as genuine or original for buying or selling with the intent to defraud or deceive.
Misdemeanor Larceny-theft It means the illegal carrying, leading, riding away, or taking of property from the possession or constructive possession of another. Examples are motor vehicle parts, thefts of bicycles shoplifting, and pocket-picking.
Fraud The willful deviation of the truth for the sake of persuading another person or other entity in dependence upon it to part with or to surrender a legal right or something of value.
Embezzlement It means the illegal misapplication or misappropriation by attacker to his/her own purpose the property, money, or control some other thing of value entrusted to his/her care and custody.
Stolen property It means receiving, selling, buying, concealing, possessing, or transporting any property with the knowledge that it has been illegally taken, as by fraud, larceny, robbery, burglary, or embezzlement. Also, attempts are implied.
Vandalism It means that to maliciously destroy or willfully, disfigure, deface, or injure any private or public property, personal or real, without the consent of the person or owner having control or custody by tearing, marking, painting, cutting, breaking, drawing, covering with filth, or any other thing. Also, attempts implied.
Gambling It means that to illegally wager or bet money on something else of value; promote, assist, or operate some stake; wagering information or transmit; purchase, manufacture, sell, transport, gambling equipment, devices, or goods.
Drunkenness It means that to drink alcohol to the edge that one’s mental functionalities, faculties, and physical coordination are substantially impaired.
Infraction and wobblers Overtime parking Parking in an area for longer than the posted time limit.
Speeding ticket It means a piece of paper that a policeman writes to a person who was driving too fast and it indicates that the driver should pay a fine.
Tailgating It means dangerous and illegal habit of driving so close to a vehicle in front. If the driver of the vehicle in front stepped the brakes suddenly, the tailgating driver has the risk of potential and unavoidable danger collision.
Weapons violation It means possessing, carrying, etc., or the violation of ordinances or laws prohibiting the sale, concealment, transportation, possession, purchase, manufacture, or use of cutting instruments, incendiary, explosives, firearms, and devices.

In addition, a crime could be categorized in other categories, such as victim, victimless, and violent crimes and there are other categorizations for crime, but through this study, only the classification mentioned in Table 1 will be considered.

4 Prediction systems challenges

Researchers and governmental security agents face some problems when it comes to predict crime’s location, time, and problems in choosing the effective method to do so. In addition, there are problems faced by the computer science researchers who used machine learning, data mining, and spatial–temporal data. In 2012 and 2016, the near-repeat-victimization and repeat-victimization methods were implemented to predict crimes in houses, streets, and regions. These methods state that if a crime happened in a block, then there is a probability that other crimes are increasing significantly in the same area [27,28].

Crime prediction system developer’s challenges:

  1. The huge amount of data requires a large amount of storage

  2. Crime-related data are usually in different formats such as text, images, graph, audio, relational data, unstructured data, and semi-structured data [29], so, the process of transforming these data to the understandable format is also a challenge.

  3. In machine learning, to give the correct label (e.g., prediction or output) to an instance (e.g., context or input) is a challenge.

  4. Use of appropriate data mining algorithm that gives better results than the used algorithms.

  5. The environment and surrounding factors, such as the lack of the law and the weather, have an impact on the likelihood of crime, which ultimately causes the crime prediction algorithms to make grave errors. Any crime forecast must take the surrounding and environmental changes into consideration to avoid making such errors and to achieve high prediction accuracy.

5 Crime datasets

Crime-related data are gathered from a variety of different sources, including police reports, social media, news, and criminal records. It is difficult to gather data of this amount [30]. The datasets are available online in many countries around the world or gathered from the police departments. During our survey, we noticed that the Chicago crime dataset is more frequently used in crime prediction systems, and that returned to the large population and hight crime rates in this area [31].

6 Related work

With the huge data size nowadays, the evaluation of machine learning and data mining techniques allow us to deal with this row data and extract the results in better ways. Techniques for criminal activity detection and, more generally, machine learning and data mining, have recently been applied to the area of policing to achieve crime reduction.

Correct choices of the parameters for these techniques can help law enforcers to analyze and find the likelihood between crimes as well as patterns and trends in criminal activities, which lead to qualify those activities more efficiently [5].

In this section, the previous related works are discussed and analyzed, these research works are widely variate, some of these take the field of crime analysis to predict, some take the field of application of Artificial Intelligent on crime data, machine learning or data mining (which are subfields of Artificial Intelligent) in order to predict and forecast violence crimes, based on spatial and temporal data in some research works.

During our survey, we noticed five surveys or overviews related to crime prediction and machine learning or data mining.

The earliest was in 2011, a survey of different methods that used to extract patterns from spatial information (they called it spatial data mining (SDM) algorithms) like co-location mining, spatial clustering, spatial hot spots, spatial outliers, spatial auto-regression, conditional auto-regression, and geographically weighted regression, which conclude the effectiveness of these SDM algorithms and the guarantee to use it in the real world, and they found the need for more methods to validate the hypotheses produced by these algorithms [32].

In 2015, some researches in the field of crime prediction with data mining and machine learning were discussed , this research takes a variety of crime related variables then found that the information influencing the crime rate such as age, alcohol, hot spots, media, some policies, etc., do not have effect on crime rate prediction [33], it succeeded in discussion, but there is a shortage in the conclusion.

In 2016, another survey was published. It reviewed over 100 applications of data mining in crime. They made a concise review by preparing a brief table containing the used technique with a specific software, the relevant study area with the expected use and function. They suggested to enhance the benefits, improvements, and usability of data mining techniques in crime data mining by introducing more training and educating fields for these techniques [34].

In 2019, a systematic review of crime prediction and data mining studies between 2004 and 2018 classified the research works based on the used data mining techniques. Based on the challenges addressed and the number of research papers according to technique used, by covering 40 papers, a gap was identified in all of them, that is, when datasets increase, there is a noticeable decrease in the system’s overall performance [35].

Finally, in 2020, another systematic review was done, 32 papers were analyzed from 2000 to 2018 in spatial crime forecasting. In this study, in addition to the surveying table that contains the information about the space and time of the research, the crime data, and forecasting details, more than one summary was given, that is, the top four proposed methods, best proposed, and baseline methods applied in the 32 selected papers. This study discussed the points of strengths, weaknesses, threats, and opportunities of the selected papers, and the conclusion was that the contiguity of algorithms should not be ignored in the future [2].

7 Classification of prediction systems

In machine learning, classification of any prediction systems is the way of defining a model that can describe the concepts or classes of information. The purpose of this model is to predict the class of objects that has an unknown class label. In the real world, police departments are not able to control and limit this large number of crime activities alone. Meanwhile, crimes are rising rapidly, therefore there is a need to use data mining with police detection efforts to predict and then reduce these crime cases. As mentioned, there is a need for technology, especially computer science technologies, using which this problem can be solved as fast as possible. Eventually, prediction systems can be classified according to many factors [36]:

  1. According to approaches, machine learning and data mining.

  2. According to prediction type, special and temporal.

  3. According to dataset, image prediction and data prediction.

8 Comparison study: Crime prediction vs classification approaches

In this section, Tables 2 and 3 lists the literature surveys of the machine learning and data mining algorithms using different datasets for different cities around the world. In addition, a comparison is made between machine learning and data mining methods toward crime in a border crime prediction system. In these tables, we enlist each selected paper with the important information that will assist other researchers in determining which categories of crime prediction techniques are most powerful. Consequently, these two tables explain the machine learning and data mining algorithms with crime prediction in order to achieve the purpose of this survey. The tables contain the references, the machine learning or data mining algorithms, the used dataset source, and the accuracy of each algorithm depending on a certain dataset that was used for a particular city. The following section discusses crime prediction research works that followed the machine learning and data mining approaches, separately.

Table 2

Literature survey on crime prediction research works with machine learning

Ref. Year Method Dataset Classification technique Acc. %
[49] 2014 Machine learning London mobile and crime data RF 70
[50] 2017 Machine learning Portland data RNN 75
LSTM 81
GRU 81
[51] 2017 Machine learning UCI machine learning repository website DT (J48) 94.25
[31] 2017 Machine learning Chicago, Illinois (image dataset) SVM 67.01
KDE 66.33
DNN 84.25
[52] 2018 Machine learning Chicago crime data DT 38
RF 59
Neural network 81
[53] 2018 Machine learning Vancouver police department KNN 39.9
Boosted decision 43.2
[47] 2018 Machine learning Chicago crime data RNN 74.1
Portland crime data 63.8
Chicago crime data CNN 72.7
Portland crime data 62.9
[46] 2018 Machine learning Taoyuan/Taiwan DNN 83
KNN 87
SVM 88
[54] 2018 Machine learning Chicago crime data KNN 78
NB 64
DT 78
[1] 2021 Machine learning Chicago crime data LR 90
DT 66
RF 77
MLP 87
NB 73
SVM 66
XGBoost 94
KNN 88
Los Angeles crime data LR 48
DT 60
RF 43
MLP 84
NB 71
SVM 60
XGBoost 88
KNN 89
[55] 2021 Machine learning Bangladesh crime data LR 73.6
[48] 2021 Machine learning Baltimore city Logistic regression 95
Neural network 94
[56] 2021 Machine learning New York crime data SVM 43
RF 50
XGBoost 52
Table 3

Literature survey on crime prediction research works with data mining

Ref. Year Method Dataset Classification technique Acc. %
[62] 2013 Data mining Different states of USA DT 83.9
[61] 2014 Data mining India crime data WEKA on two K-mean clusters 93.62 for C1
93.99 for C2
[63] 2015 Data mining Denver crime data NB 51
DT 42
Los Angeles crime data NB 54
DT 43
[64] 2017 Data mining Chicago crime data DT 75.9
RF 83.39
NB 77.64
[55] 2021 Data mining Bangladesh crime data NB 69.5
KNN 76.9
[65] 2021 Data mining Indian and Bangalore crimes data KDE 77.49

8.1 Machine learning and crime prediction

Crime prediction has been studied widely due to its relation with the society, these studies employ machine learning algorithms to outfit the crime predicting and forecasting issues. Machine learning algorithms are successfully used to predict spatial crime information. So, in 2006, Support Vector Machine (SVM) algorithm was applied to predict the location of crimes in Columbus, Ohio, US. SVM used both random and clustering approaches to train and test dataset and then predict the hot spot area and improve its effectiveness [37]. These algorithms are used to study the correlation between crime occurrence and crime motivates. In 2013, a Logistic Regression (LR) algorithm was implemented to forecast the relationship between burglar crimes and several other factors which are time of the day, day of the week, barriers, connectors, and repeated victimization, but this model was a failure for large geo-area [38].

In 2015, crime was predicted in southern US states using Random Forest (RF) method after applying SmoteR algorithm to detect the more dangerous crimes. In addition, their work was optimized using R software after the density and population were selected as real values [39].

Eventually, the auto-regressive approach was implemented to forecast the number of crimes that happened in the same time and predict them in urban areas [40]. In 2017, Naive Bayes (NB) algorithm was proposed to predict crime incident depending on history data that shows the same crime happening in the same place. Moreover, NB model was compared with Decision Tree (DT) algorithm in order to test the performance of the proposed method, and found that the NB outperforms the DT even with the computational complexity of DT [41].

In 2020, many research works were presented, one of them fused three methods, the Long short-term memory (LSTM), Residual neural network, and Graph convolutional network to propose a certain mechanism, which was able to extract spatial–temporal features to predict crimes in Chicago. In addition, Root mean square error and Mean absolute error were used as a criterion to test the performance of the applied method [42]. On the other study, a crime network for spatiotemporal data was proposed using Convolutional neural network (CNN) in order to automatically predict the time and place of the crimes [43]. And in another study [44], Recurrent neural network (RNN) with LSTM was integrated in order to design time series crime prediction system to predict crimes in Addis Ababa. Also, in one more study [45], the severity level of crime in Boston was studied and predicted using machine learning algorithm such as SVM, NB, LR, and DT.

We can say that machine learning field successfully analyzed how crime behavior develops over time. Many conclusions were reached from the survey shown in Table 2, such as:

  1. According to ref. [31], the Deep neural network (DNN) has overcome the SVM, but according to ref. [46], the opposite occurred, the SVM has overcome the DNN, and this can be justified by one reason, the first has worked on an image dataset and the second has worked on a text dataset. So, it is recommended to use DNN in case of an image crime dataset.

  2. According to refs [1,47], using the same system on two different crime datasets leads to different accuracy percentages with big difference, which shows that the dataset utilized severely affects the results gained. Therefore, this presents a challenge to these algorithms to prove its efficiency and then its accuracy to predict a crime.

  3. After surveying the machine learning approaches, the highest accuracy crime prediction results gained are shown in Table 2.

  4. According to ref. [48], the LR algorithm achieves the highest accuracy among the different machine learning algorithms.

  5. When observing the crime prediction results of the research works adopting the RF method, it was noticed that the highest accuracy achieved is 59.8%, which is considered a poor accuracy, compared with other methods.

  6. The standard deviations of crime prediction accuracies for each algorithm show that the SVM algorithm outperforms the LR algorithm and achieves (71.9%) accuracy. Actually, it outperforms all other machine learning algorithm’s standard deviation results.

According to the previous studies, it was noticed that the highest crime prediction accuracy results were gained through the machine learning logistic regression method, which was 95% for Baltimore city in ref. [48]. Furthermore, algorithms such as XGBoost and Logistic Regression have achieved a high accuracy of 94 and 90%, respectively [1]. However, it can be noticed that the same algorithm can perform differently with two different datasets, and this proves that the dataset has a large influence on the crime prediction results.

8.2 Data mining and crime prediction

In 2011, special data mining and technologies were proposed to extract patterns from spatial and temporal data. In addition, the data were mined geospatially using special knowledge. In 2011, crimes were predicted in Portland; data mining methods were used to forecast crimes using spatial and temporal dataset collected in Portland and predict whether residential burglary will happen. The methods NB, SVM, DT, and K-Nearest Neighbor algorithms were applied to predict crimes and the result was compared between these methods, which shows the power of neural network in complex systems [57]. Moreover, the pattern extraction usefulness was limited by the complexity of the relationships between spatial data [32]. In 2016, high accuracy was achieved using various DT algorithms to extract knowledge from data collected during 1994 instances, with 128 attributes, then made a comparison between them. In addition, the data were trained and tested using scatter plots to illustrate the crime areas with the severity of each area based on previous data [58]. In year 2016, data mining algorithms were developed and used to classify these crimes based on their types. A crime was characterized according to time, based on factors such as vacations that started with the academic year for colleges and schools. In addition, the classifier was used to predict the severity risk of the crime areas in Denver city between 2010 and 2015 [59]. In 2020, Autoregressive Integrated Moving Average (ARIMA) technique was implemented to predict time series data and then have been visualized with data mining platform. This technique proved that regressive model can work on historical newsfeed data to predict future crimes [60].

Many conclusions were reached from this survey, such as:

  1. Table 3 shows the comparison of many algorithms implemented against crime prediction challenge, such as DT, NB, RF, etc., either individually or group of them to a certain type of dataset and city. Thereby, this presents a challenge to these algorithms to confirm its effectiveness and then its accuracy to predict a crime.

  2. The highest accuracy crime prediction results gained, based on the survey of the data mining methods, are shown in Table 3.

  3. According to ref. [61] the K-mean algorithm achieves the highest accuracy among the different data mining algorithms.

  4. When we take the standard deviations of crime prediction accuracies for each algorithm, we noticed that the DT algorithm outperforms the NB algorithm and achieves (18.9%) accuracy.

According to the previous study, DT and Neural Network have recorded 94% accuracy for different datasets in refs [48,51] for machine learning algorithms. The k-mean data mining algorithm achieved 93.62% (cluster one) and 93.99% (cluster two) for crimes in India [61].

9 Conclusion

Crime prediction became the hot research area nowadays because of its correlation benefits to any society or nation’s security. It is found that many studies adopted supervised learning approaches to the field of crime prediction compared to others.

It is obviously concluded, that data mining methods achieved the highest crime prediction accuracies, overcoming machine learning methods. Regardless of this, on average, the machine learning out performs data mining in crime prediction. But, when we use the standard deviation of crime prediction accuracies of machine learning and data mining, we can say that the machine learning algorithms perform better than the data mining algorithms.

Eventually, it can be concluded that the comparison of machine learning and data mining algorithms for crime prediction systems give certain indications, such as the selection of an algorithm may depend on the dataset type (like image, text, video, or voice dataset), and there are certain algorithms that preform perfectly on average, but can fail working with other datasets. Crime prediction methods adopting deep learning algorithms were not covered through this survey for time limitation reasons.

Acknowledgements

The authors, want to thank all researchers whose works have been cited in this survey, in the field of crime prediction.

  1. Funding information: This project is funded by the authors only.

  2. Conflict of interest: The authors declare that there is no conflict of interest regarding the publication of this article.

References

[1] Safat W, Asghar S, Gillani SA. Empirical analysis for crime prediction and forecasting using machine learning and deep learning techniques. IEEE Access J. 2021;9:70080–94.10.1109/ACCESS.2021.3078117Search in Google Scholar

[2] Kounadi O, Ristea A, Araujo A, Leitner M. A systematic review on spatial crime forecasting. Crime Sci. 2020;9(1):1–22.10.1186/s40163-020-00116-7Search in Google Scholar PubMed PubMed Central

[3] Tollenaar N, van der Heijden PGM. Which method predicts recidivism best?: a comparison of statistical, machine learning and data mining predictive models. J R Stat Soc Ser A. 2013;176(2):565–84.10.1111/j.1467-985X.2012.01056.xSearch in Google Scholar

[4] Enzmann D, Podana Z. Official crime statistics and survey data: Comparing trends of youth violence between 2000 and 2006 in cities of the Czech Republic, Germany, Poland, Russia, and Slovenia. Eur J Crim Policy Res. 2010;16(3):191–205.10.1007/s10610-010-9121-zSearch in Google Scholar

[5] Holst A, Bjurling B. A Bayesian parametric statistical anomaly detection method for finding trends and patterns in criminal behavior. In 2013 European Intelligence and Security Informatics Conference. IEEE; 2013.10.1109/EISIC.2013.19Search in Google Scholar

[6] Brunsdon C, Corcoran J, Higgs G. Visualising space and time in crime patterns: A comparison of methods. Comput Environ Urban Syst. 2007;31(1):52–75.10.1016/j.compenvurbsys.2005.07.009Search in Google Scholar

[7] Vural MS, Gök M, Yetgin Z. Generating incident-level artificial data using GIS based crime simulation. In 2013 International Conference on Electronics, Computer and Computation (ICECCO). IEEE; 2013.10.1109/ICECCO.2013.6718273Search in Google Scholar

[8] Xiang Y, Chau M, Atabakhsh H, Chen H. Visualizing criminal relationships: Comparison of a hyperbolic tree and a hierarchical list. Decis Support Syst. 2005;41(1):69–83.10.1016/j.dss.2004.02.006Search in Google Scholar

[9] Jain LC, Seera M, Lim CP, Balasubramaniam P. A review of online learning in supervised neural networks. Neural Comput Appl. 2014;25(3):491–509.10.1007/s00521-013-1534-4Search in Google Scholar

[10] EL Aissaoui O, EL Madani EY, Oughdir L, EL Allioui Y. Combining supervised and unsupervised machine learning algorithms to predict the learners’ learning styles. Procedia Comput Sci. 2019;148:87–96.10.1016/j.procs.2019.01.012Search in Google Scholar

[11] Mackenzie DM. CDUL: class directed unsupervised learning. Neural Comput Appl. 1995;3(1):2–16.10.1007/BF01414172Search in Google Scholar

[12] Rossmo DK, Laverty I, Moore B. Geographic profiling for serial crime investigation, in Geographic information systems and crime analysis. IGI Glob. 2005;6:102–17.10.4018/978-1-59140-453-8.ch006Search in Google Scholar

[13] Ristea A, Leitner M. Urban crime mapping and analysis using GIS. ISPRS Int J Geo-Information. 2020;9(9):511.10.3390/ijgi9090511Search in Google Scholar

[14] Corcoran JJ, Wilson ID, Ware JA. Predicting the geo-temporal variations of crime and disorder. Int J Forecast. 2003;19(4):623–34.10.1016/S0169-2070(03)00095-5Search in Google Scholar

[15] Sangani A, Sampat C, Pinjarkar V. Crime prediction and analysis. In 2nd International Conference on Advances in Science & Technology (ICAST); 2019.10.2139/ssrn.3367712Search in Google Scholar

[16] Wang Y, Peng X, Bian J. Computer crime forensics based on improved decision tree algorithm. J Netw. 2014;9(4):1005.10.4304/jnw.9.4.1005-1011Search in Google Scholar

[17] Khan M, Ali A, Alharbi Y. Predicting and preventing crime: A crime prediction model using San Francisco crime data by classification techniques. New Jersey: Wiley/Hindawi. Vol. 2022, No. 4830411, 2022. p. 13.10.1155/2022/4830411Search in Google Scholar

[18] Ewart BW, Oatley GC. Applying the concept of revictimization: using burglars’ behaviour to predict houses at risk of future victimization. Int J Police Sci Manag. 2003;5(2):69–84.10.1350/ijps.5.2.69.14324Search in Google Scholar

[19] Box GEP, Jenkins GM, Reinsel GC, Ljung GM. Time series analysis: forecasting and control. John Wiley & Sons; 2015.Search in Google Scholar

[20] Jangra M, Kalsi S. Naïve Bayes approach for the crime prediction in Data Mining. Int J Comput Appl. 2019;178(4):33–7.10.5120/ijca2019918907Search in Google Scholar

[21] Khairuddin A, Alwee R, Haron H. A comparative analysis of artificial intelligence techniques in forecasting violent crime rate. In IOP Conference Series: Materials Science and Engineering. IOP Publishing; 2020.Search in Google Scholar

[22] Sardana D, Marwaha S, Bhatnagar R. Supervised and unsupervised machine learning methodologies for crime pattern analysis. Int J Artif Intell Appl. 2021;12(1):43–58.10.5121/ijaia.2021.12106Search in Google Scholar

[23] Sivanagaleela B, Rajesh S. Crime analysis and prediction using fuzzy c-means algorithm. In 3rd International Conference on Trends in Electronics and Informatics (ICOEI). IEEE; 2019.10.1109/ICOEI.2019.8862691Search in Google Scholar

[24] Liu X, Sun H, Han S, Han S, Niu S, Qin W, et al. A data mining research on office building energy pattern based on time-series energy consumption data. Energy Build. 2022;259:111888.10.1016/j.enbuild.2022.111888Search in Google Scholar

[25] Borowik G, Wawrzyniak ZM, Cichosz P. Time series analysis for crime forecasting. In 2018 26th International Conference on Systems Engineering (ICSEng). IEEE; 2018.10.1109/ICSENG.2018.8638179Search in Google Scholar

[26] Goel A, Singh B. White collar crimes: A study in the context of classification, causation and preventive measures. Contemp Soc Sci. 2018;27:84–92.10.29070/27/58311Search in Google Scholar

[27] Grove L, Farrell G. Once bitten, twice shy: Repeat victimization and its prevention. Oxf Handb Crime Prev. 2012;404–19.10.1093/oxfordhb/9780195398823.013.0020Search in Google Scholar

[28] Chainey SP, da Silva BFA. Examining the extent of repeat and near repeat victimisation of domestic burglaries in Belo Horizonte, Brazil. Crime Sci. 2016;5(1):1–10.10.1186/s40163-016-0049-6Search in Google Scholar

[29] Peter P, Ickjai L. Crime analysis through spatial areal aggregated density patterns. Geoinformatica. 2011;15(1):49–74.10.1007/s10707-010-0116-1Search in Google Scholar

[30] Jonas p, Paul E, Stijn V, Marc MVH, Guido D. Gaining insight in domestic violence with emergent self organizing maps. Expert Syst Appl. 2009;36(9):11864–74.10.1016/j.eswa.2009.04.027Search in Google Scholar

[31] Kang HW, Kang HB. Prediction of crime occurrence from multi-modal data using deep learning. PLoS One. 2017;12(4):e0176244.10.1371/journal.pone.0176244Search in Google Scholar PubMed PubMed Central

[32] Shekhar S, Evans MR, Kang JM, Mohan P. Identifying patterns in spatial information: A survey of methods. Wiley Interdiscip Reviews Data Min Knowl Discovery. 2011;1(3):193–214.10.1002/widm.25Search in Google Scholar

[33] Mookiah L, Eberle W, Siraj A. Survey of crime analysis and prediction. In The Twenty-Eighth International Flairs Conference; 2015.Search in Google Scholar

[34] Hassani H, Huang X, Silva ES, Ghodsi M. A review of data mining applications in crime. Stat Anal Data Mining: ASA Data Sci J. 2016;9(3):139–54.10.1002/sam.11312Search in Google Scholar

[35] Falade A, Azeta A, Oni A, Odun-ayo I. Systematic literature review of crime prediction and data mining. Rev Comput Eng Stud. 2019;6(3):56–63.10.18280/rces.060302Search in Google Scholar

[36] Okeke OC. An overview of crime analysis, prevention and predicton using data mining based on real time and location data. Int J Recent Technol Eng. 2022;5(10):99–103.10.33564/IJEAST.2021.v05i10.015Search in Google Scholar

[37] Kianmehr K, Alhajj R. Crime hot-spots prediction using support vector machine. In IEEE International Conference on Computer Systems and Applications. IEEE Computer Society; 2006.10.1109/AICCSA.2006.205203Search in Google Scholar

[38] Antolos D, Liu D, Ludu A, Vincenzi D. Burglary crime analysis using logistic regression. In International Conference on Human Interface and the Management of Information. Berlin: Springer; 2013.10.1007/978-3-642-39226-9_60Search in Google Scholar

[39] Cavadas B, Branco P, Pereira S. Crime prediction using regression and resources optimization. In Portuguese Conference on Artificial Intelligence. Springer; 2015.10.1007/978-3-319-23485-4_51Search in Google Scholar

[40] Cesario E, Catlett C, Talia D. Forecasting crimes using autoregressive models. in 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress (DASC/PiCom/DataCom/CyberSciTech). IEEE; 2016.10.1109/DASC-PICom-DataCom-CyberSciTec.2016.138Search in Google Scholar

[41] Vural MS, Gök M. Criminal prediction using Naive Bayes theory. Neural Comput Appl. 2017;28(9):2581–92.10.1007/s00521-016-2205-zSearch in Google Scholar

[42] Hou M, Hu X, Cai J, Han X, Yuan S. An integrated graph model for spatial–temporal urban crime prediction based on attention mechanism. ISPRS Int J Geo-Information. 2022;11(5):294.10.3390/ijgi11050294Search in Google Scholar

[43] Ilhan F, Tekin SF, Aksoy B. Spatio-temporal crime prediction with temporally hierarchical convolutional neural networks. In 2020 28th Signal Processing and Communications Applications Conference (SIU). IEEE; 2020.10.1109/SIU49456.2020.9302169Search in Google Scholar

[44] Meskela TE, Afework YK, Ayele NA, Teferi MW, Mengist TB. Designing time series crime prediction model using long short-term memory recurrent neural network. Int J Recent Technol Eng. 2020;9:402–5.10.35940/ijrte.D5025.119420Search in Google Scholar

[45] Hussain FS, Aljuboori AF. A crime data analysis of prediction based on classification approaches. Baghdad Sci J. 2022;4:1073–7.Search in Google Scholar

[46] Lin YL, Yen MF, Yu LC. Grid-based crime prediction using geographical features. ISPRS Int J Geo-Information. 2018;7(8):298.10.3390/ijgi7080298Search in Google Scholar

[47] Stec A, Klabjan D. Forecasting crime with deep learning. arXiv preprint arXiv; 2018. p. 01486.Search in Google Scholar

[48] Kim KS, Jeong YH. A study on crime prediction to reduce crime rate based on artificial intelligence. Korea J Artif Intell. 2021;9(1):15–20.Search in Google Scholar

[49] Bogomolov A, Lepri B, Staiano J, Oliver N, Pianesi F, Pentland A. Once upon a crime: towards crime prediction from demographics and mobile data. In Proceedings of the 16th International Conference on Multimodal Interaction; 2014.10.1145/2663204.2663254Search in Google Scholar

[50] Zhuang Y, Almeida M, Morabito M Ding W. Crime hot spot forecasting: A recurrent model with spatial and temporal information. In 2017 IEEE International Conference on Big Knowledge (ICBK). IEEE; 2017.10.1109/ICBK.2017.3Search in Google Scholar

[51] Ivan N, Ahishakiye E, Omulo EO, Taremwa D. Crime prediction using decision tree (J48) classification algorithm. International Journal of Computer and Information Technology. 2017;6:188–95.Search in Google Scholar

[52] El Bour HA, Ounacer S, Elghomari Y, Jihal H, Azzouazi M. A crime prediction model based on spatial and temporal data. Periodicals Eng Nat Sci. 2018;6(2):360–4.10.21533/pen.v6i2.524Search in Google Scholar

[53] Kim S, Joshi P, Kalsi PS, Taheri P. Crime analysis through machine learning. In IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON). IEEE; 2018.10.1109/IEMCON.2018.8614828Search in Google Scholar

[54] Bharati A, RA KS. Crime prediction and analysis using machine learning. Int Res J Eng Technol (IRJET). 2018;5:1037–42.Search in Google Scholar

[55] Mahmud S, Nuha M, Sattar A. Crime Rate Prediction Using Machine Learning and Data Mining, in Soft Computing Techniques and Applications. Singapore: Springer; 2021. p. 59–69.10.1007/978-981-15-7394-1_5Search in Google Scholar

[56] Almuhanna AA, Alrehili MM, Alsubhi SH, Syed L. Prediction of crime in neighbourhoods of New York City using spatial data analysis. In 2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA). IEEE; 2021.10.1109/CAIDA51941.2021.9425120Search in Google Scholar

[57] Yu CH, Ward MW, Morabito M, Ding W. Crime forecasting using data mining techniques. In 2011 IEEE 11th International Conference on Data Mining Workshops. IEEE; 2011.10.1109/ICDMW.2011.56Search in Google Scholar

[58] Sharma H, Kumar S. A survey on decision tree algorithms of classification in data mining. Int J Sci Res. 2016;5(4):2094–7.10.21275/v5i4.NOV162954Search in Google Scholar

[59] Gupta A, Mohammad A, Syed A, Halgamuge MN. A comparative study of classification algorithms using data mining: crime and accidents in Denver City the USA. Education. 2016;7(7):374–81.10.14569/IJACSA.2016.070753Search in Google Scholar

[60] Boppuru PR, Ramesha K. Spatio-temporal crime analysis using KDE and ARIMA models in the indian context. Int J Digital Crime Forensics. 2020;12(4):1–19.10.4018/IJDCF.2020100101Search in Google Scholar

[61] Tayal D, Jain A, Arora S, Agarwal S, Gupta T, Tyagi N. Crime detection and criminal identification in India using data mining techniques. AI Soc. 2015;30(1):117–27.10.1007/s00146-014-0539-6Search in Google Scholar

[62] Iqbal R, Murad MAA, Mustapha A, Panahy PHS, Khanahmadliravi N. An experimental study of classification algorithms for crime prediction. Indian J Sci Technol. 2013;6(3):4219–25.10.17485/ijst/2013/v6i3.6Search in Google Scholar

[63] Almanie T, Mirza R, Lor E. Crime prediction based on crime types and using spatial and temporal criminal hotspots. arXiv preprint arXiv; 2015. p. 02050.10.5121/ijdkp.2015.5401Search in Google Scholar

[64] Yerpude P, Gudur V. Predictive modelling of crime dataset using data mining. Int J Data Min Knowl Manag Process. 2020;7:83–99.Search in Google Scholar

[65] Prathap BR, Krishna A, Balachandran K. Crime analysis and forecasting on spatio temporal news feed data—An indian context, in artificial intelligence and blockchain for future cybersecurity applications. Switzerland: Springer; 2021. p. 307–27.10.1007/978-3-030-74575-2_16Search in Google Scholar

Received: 2022-08-07
Revised: 2022-11-01
Accepted: 2022-11-09
Published Online: 2023-03-31

© 2023 the author(s), published by De Gruyter

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

  1. Research Articles
  2. Salp swarm and gray wolf optimizer for improving the efficiency of power supply network in radial distribution systems
  3. Deep learning in distributed denial-of-service attacks detection method for Internet of Things networks
  4. On numerical characterizations of the topological reduction of incomplete information systems based on evidence theory
  5. A novel deep learning-based brain tumor detection using the Bagging ensemble with K-nearest neighbor
  6. Detecting biased user-product ratings for online products using opinion mining
  7. Evaluation and analysis of teaching quality of university teachers using machine learning algorithms
  8. Efficient mutual authentication using Kerberos for resource constraint smart meter in advanced metering infrastructure
  9. Recognition of English speech – using a deep learning algorithm
  10. A new method for writer identification based on historical documents
  11. Intelligent gloves: An IT intervention for deaf-mute people
  12. Reinforcement learning with Gaussian process regression using variational free energy
  13. Anti-leakage method of network sensitive information data based on homomorphic encryption
  14. An intelligent algorithm for fast machine translation of long English sentences
  15. A lattice-transformer-graph deep learning model for Chinese named entity recognition
  16. Robot indoor navigation point cloud map generation algorithm based on visual sensing
  17. Towards a better similarity algorithm for host-based intrusion detection system
  18. A multiorder feature tracking and explanation strategy for explainable deep learning
  19. Application study of ant colony algorithm for network data transmission path scheduling optimization
  20. Data analysis with performance and privacy enhanced classification
  21. Motion vector steganography algorithm of sports training video integrating with artificial bee colony algorithm and human-centered AI for web applications
  22. Multi-sensor remote sensing image alignment based on fast algorithms
  23. Replay attack detection based on deformable convolutional neural network and temporal-frequency attention model
  24. Validation of machine learning ridge regression models using Monte Carlo, bootstrap, and variations in cross-validation
  25. Computer technology of multisensor data fusion based on FWA–BP network
  26. Application of adaptive improved DE algorithm based on multi-angle search rotation crossover strategy in multi-circuit testing optimization
  27. HWCD: A hybrid approach for image compression using wavelet, encryption using confusion, and decryption using diffusion scheme
  28. Environmental landscape design and planning system based on computer vision and deep learning
  29. Wireless sensor node localization algorithm combined with PSO-DFP
  30. Development of a digital employee rating evaluation system (DERES) based on machine learning algorithms and 360-degree method
  31. A BiLSTM-attention-based point-of-interest recommendation algorithm
  32. Development and research of deep neural network fusion computer vision technology
  33. Face recognition of remote monitoring under the Ipv6 protocol technology of Internet of Things architecture
  34. Research on the center extraction algorithm of structured light fringe based on an improved gray gravity center method
  35. Anomaly detection for maritime navigation based on probability density function of error of reconstruction
  36. A novel hybrid CNN-LSTM approach for assessing StackOverflow post quality
  37. Integrating k-means clustering algorithm for the symbiotic relationship of aesthetic community spatial science
  38. Improved kernel density peaks clustering for plant image segmentation applications
  39. Biomedical event extraction using pre-trained SciBERT
  40. Sentiment analysis method of consumer comment text based on BERT and hierarchical attention in e-commerce big data environment
  41. An intelligent decision methodology for triangular Pythagorean fuzzy MADM and applications to college English teaching quality evaluation
  42. Ensemble of explainable artificial intelligence predictions through discriminate regions: A model to identify COVID-19 from chest X-ray images
  43. Image feature extraction algorithm based on visual information
  44. Optimizing genetic prediction: Define-by-run DL approach in DNA sequencing
  45. Study on recognition and classification of English accents using deep learning algorithms
  46. Review Articles
  47. Dimensions of artificial intelligence techniques, blockchain, and cyber security in the Internet of medical things: Opportunities, challenges, and future directions
  48. A systematic literature review of undiscovered vulnerabilities and tools in smart contract technology
  49. Special Issue: Trustworthy Artificial Intelligence for Big Data-Driven Research Applications based on Internet of Everythings
  50. Deep learning for content-based image retrieval in FHE algorithms
  51. Improving binary crow search algorithm for feature selection
  52. Enhancement of K-means clustering in big data based on equilibrium optimizer algorithm
  53. A study on predicting crime rates through machine learning and data mining using text
  54. Deep learning models for multilabel ECG abnormalities classification: A comparative study using TPE optimization
  55. Predicting medicine demand using deep learning techniques: A review
  56. A novel distance vector hop localization method for wireless sensor networks
  57. Development of an intelligent controller for sports training system based on FPGA
  58. Analyzing SQL payloads using logistic regression in a big data environment
  59. Classifying cuneiform symbols using machine learning algorithms with unigram features on a balanced dataset
  60. Waste material classification using performance evaluation of deep learning models
  61. A deep neural network model for paternity testing based on 15-loci STR for Iraqi families
  62. AttentionPose: Attention-driven end-to-end model for precise 6D pose estimation
  63. The impact of innovation and digitalization on the quality of higher education: A study of selected universities in Uzbekistan
  64. A transfer learning approach for the classification of liver cancer
  65. Review of iris segmentation and recognition using deep learning to improve biometric application
  66. Special Issue: Intelligent Robotics for Smart Cities
  67. Accurate and real-time object detection in crowded indoor spaces based on the fusion of DBSCAN algorithm and improved YOLOv4-tiny network
  68. CMOR motion planning and accuracy control for heavy-duty robots
  69. Smart robots’ virus defense using data mining technology
  70. Broadcast speech recognition and control system based on Internet of Things sensors for smart cities
  71. Special Issue on International Conference on Computing Communication & Informatics 2022
  72. Intelligent control system for industrial robots based on multi-source data fusion
  73. Construction pit deformation measurement technology based on neural network algorithm
  74. Intelligent financial decision support system based on big data
  75. Design model-free adaptive PID controller based on lazy learning algorithm
  76. Intelligent medical IoT health monitoring system based on VR and wearable devices
  77. Feature extraction algorithm of anti-jamming cyclic frequency of electronic communication signal
  78. Intelligent auditing techniques for enterprise finance
  79. Improvement of predictive control algorithm based on fuzzy fractional order PID
  80. Multilevel thresholding image segmentation algorithm based on Mumford–Shah model
  81. Special Issue: Current IoT Trends, Issues, and Future Potential Using AI & Machine Learning Techniques
  82. Automatic adaptive weighted fusion of features-based approach for plant disease identification
  83. A multi-crop disease identification approach based on residual attention learning
  84. Aspect-based sentiment analysis on multi-domain reviews through word embedding
  85. RES-KELM fusion model based on non-iterative deterministic learning classifier for classification of Covid19 chest X-ray images
  86. A review of small object and movement detection based loss function and optimized technique
Downloaded on 16.12.2025 from https://www.degruyterbrill.com/document/doi/10.1515/jisys-2022-0223/html
Scroll to top button