Results for ' data clustering algorithm'

994 found
Order:
  1.  33
    Clustering Algorithms in Hybrid Recommender System on MovieLens Data.Urszula Kuzelewska - 2014 - Studies in Logic, Grammar and Rhetoric 37 (1):125-139.
    Decisions are taken by humans very often during professional as well as leisure activities. It is particularly evident during surfing the Internet: selecting web sites to explore, choosing needed information in search engine results or deciding which product to buy in an on-line store. Recommender systems are electronic applications, the aim of which is to support humans in this decision making process. They are widely used in many applications: adaptive WWW servers, e-learning, music and video preferences, internet stores etc. In (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  2.  20
    Neutrosophic Clustering Algorithm Based on Sparse Regular Term Constraint.Dan Zhang, Yingcang Ma, Hu Zhao & Xiaofei Yang - 2021 - Complexity 2021:1-12.
    Clustering algorithm is one of the important research topics in the field of machine learning. Neutrosophic clustering is the generalization of fuzzy clustering and has been applied to many fields. This paper presents a new neutrosophic clustering algorithm with the help of regularization. Firstly, the regularization term is introduced into the FC-PFS algorithm to generate sparsity, which can reduce the complexity of the algorithm on large data sets. Secondly, we propose a (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  3.  48
    Construction of Student Information Management System Based on Data Mining and Clustering Algorithm.XueHong Yin - 2021 - Complexity 2021:1-11.
    Data mining is a new technology developed in recent years. Through data mining, people can discover the valuable and potential knowledge hidden behind the data and provide strong support for scientifically making various business decisions. This paper applies data mining technology to the college student information management system, mines student evaluation information data, uses data mining technology to design student evaluation information modules, and digs out the factors that affect student development and the various (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  4.  20
    Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks.Shihua Liu - 2022 - Complexity 2022:1-13.
    The clustering of mixed-attribute data is a vital and challenging issue. The density peaks clustering algorithm brings us a simple and efficient solution, but it mainly focuses on numerical attribute data clustering and cannot be adaptive. In this paper, we studied the adaptive improvement method of such an algorithm and proposed an adaptive mixed-attribute data clustering method based on density peaks called AMDPC. In this algorithm, we used the unified distance (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  5.  8
    An FCM clustering algorithm based on the identification of accounting statement whitewashing behavior in universities.Qihao Yang - 2022 - Journal of Intelligent Systems 31 (1):345-355.
    The traditional recognition method of whitewash behavior of accounting statements needs to analyze a large number of special data samples. The learning rate of the algorithm is low, resulting in low recognition accuracy. To solve the aforementioned problems, this article proposes a method to identify the whitewash behavior of university accounting statements based on the FCM clustering algorithm. This article analyzes the motivation of university accounting statement whitewashing behavior, studies the common means of statement whitewashing, and (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  6.  15
    Results of testing, research and analysis of the basic clustering algorithms of numerical data sets.Trokhymchuk R. M. - 2019 - Artificial Intelligence Scientific Journal 24 (1-2):101-107.
    This work is devoted to the testing, research and comparative analysis of the most well-known and widely used methods and algorithms for clustering numerical data sets. Multidimensional scaling was applied to evaluate the results of solving the clustering problem by visualizing datasets at all stages of the implementation of the studied algorithms. All algorithms were tested for artificial and real data sets. As a result, for each of the investigated algorithms, the main characteristics were formulated in (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  7.  17
    Application of clustering algorithm in complex landscape farmland synthetic aperture radar image segmentation.Mohammad Shabaz, Korhan Cengiz, Zhenxing Hua, Biao Cong & Zhuoran Chen - 2021 - Journal of Intelligent Systems 30 (1):1014-1025.
    In synthetic aperture radar image segmentation field, regional algorithms have shown great potential for image segmentation. The SAR images have a multiplicity of complex texture, which are difficult to be divided as a whole. Existing algorithm may cause mixed super-pixels with different labels due to speckle noise. This study presents the technique based on organization evolution algorithm to improve ISODATA in pixels. This approach effectively filters out the useless local information and successfully introduces the effective information. To verify (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  8.  24
    Precision Marketing Method of E-Commerce Platform Based on Clustering Algorithm.Bei Zhang, Luquan Wang & Yuanyuan Li - 2021 - Complexity 2021:1-10.
    In user cluster analysis, users with the same or similar behavior characteristics are divided into the same group by iterative update clustering, and the core and larger user groups are detected. In this paper, we present the formulation and data mining of the correlation rules based on the clustering algorithm through the definition and procedure of the algorithm. In addition, based on the idea of the K-mode clustering algorithm, this paper proposes a (...) method combining related rules with multivalued discrete features. In this paper, we construct a method to calculate the similarity between users using Jaccard distance and combine correlation rules with Jaccard distances to improve the similarity between users. Next, we propose a clustering method suitable for MDF. Finally, the basic K-mode algorithm is improved by the similarity measure method combining the correlation rule with the Jaccard distance and the cluster center update method which is the ARMDKM algorithm proposed in this paper. This method solves the problem that the MDF cannot be effectively processed in the traditional model and demonstrates its theoretical correctness. This experiment verifies the correctness of the new method by clustering purity, entropy, contour, and other indicators. (shrink)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  9.  20
    Interactive Multimodal Television Media Adaptive Visual Communication Based on Clustering Algorithm.Huayuan Yang & Xin Zhang - 2020 - Complexity 2020:1-9.
    This article starts with the environmental changes in human cognition, analyzes the virtual as the main feature of visual perception under digital technology, and explores the transition from passive to active human cognitive activities. With the diversified understanding of visual information, human contradiction of memory also began to become prominent. Aiming at the problem that the existing multimodal TV media recognition methods have low recognition rate of unknown application layer protocols, an adaptive clustering method for identifying unknown application layer (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  10.  8
    Early Warning of Financial Risk Based on K-Means Clustering Algorithm.Zhangyao Zhu & Na Liu - 2021 - Complexity 2021:1-12.
    The early warning of financial risk is to identify and analyze existing financial risk factors, determine the possibility and severity of occurring risks, and provide scientific basis for risk prevention and management. The fragility of financial system and the destructiveness of financial crisis make it extremely important to build a good financial risk early-warning mechanism. The main idea of the K-means clustering algorithm is to gradually optimize clustering results and constantly redistribute target dataset to each clustering (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  11.  18
    A Data-Driven Parameter Adaptive Clustering Algorithm Based on Density Peak.Tao Du, Shouning Qu & Qin Wang - 2018 - Complexity 2018:1-14.
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  12.  16
    Data Analysis of College Students’ Mental Health Based on Clustering Analysis Algorithm.Yichen Chu & Xiaojian Yin - 2021 - Complexity 2021:1-10.
    Mental health is an important basic condition for college students to become adults. Educators gradually attach importance to strengthening the mental health education of college students. This paper makes a detailed analysis and research on college students’ mental health, expounds the development and application of clustering analysis algorithm, applies the distance formula and clustering criterion function commonly used in clustering analysis, and makes a specific description of some classic algorithms of clustering analysis. Based on expounding (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  13.  13
    Design of digital economy consumer psychology prediction model based on canopy clustering algorithm.Yue Zhang, Peng Ruan & Jingfeng Zhao - 2022 - Frontiers in Psychology 13.
    With the continuous improvement of the level of science and technology, the popularization of the Internet and the development of applications, online consumption has become a major force in personal consumption. As a result, digital consumption is born, and digital consumption is not only reflected in transaction consumption at the monetary level. Like some intangible services similar to the use of dating software, it can also become digital consumption. In this environment, a new economic concept, the digital economy, has emerged (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  14.  14
    Audit Analysis of Abnormal Behavior of Social Security Fund Based on Adaptive Spectral Clustering Algorithm.Yan Wu, Yonghong Chen & Wenhao Ling - 2021 - Complexity 2021:1-11.
    Abnormal behavior detection of social security funds is a method to analyze large-scale data and find abnormal behavior. Although many methods based on spectral clustering have achieved many good results in the practical application of clustering, the research on the spectral clustering algorithm is still in the early stage of development. Many existing algorithms are very sensitive to clustering parameters, especially scale parameters, and need to manually input the number of clustering. Therefore, a (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  15.  52
    Polarisation assessment in an intelligent argumentation system using fuzzy clustering algorithm for collaborative decision support.Ravi Santosh Arvapally & Xiaoqing Liu - 2013 - Argument and Computation 4 (3):181-208.
    We developed an on-line intelligent argumentation system which facilitates stakeholders in exchanging dialogues. It provides decision support by capturing stakeholders’ rationale through arguments. As part of the argumentation process, stakeholders tend to both polarise their opinions and form polarisation groups. The challenging issue of assessing argumentation polarisation had not been addressed in argumentation systems until recently. Arvapally, Liu, and Jiang [, ‘Identification of Faction Groups and Leaders in Web-Based Intelligent Argumentation System for Collaborative Decision Support’, in Proceedings of International Conference (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  16.  11
    Behavior feature extraction method of college students’ social network in sports field based on clustering algorithm.Haiou Sun & Yonggang Wang - 2022 - Journal of Intelligent Systems 31 (1):477-488.
    In order to improve the integrity of the social network behavior feature extraction results for sports college students, this study proposes to be based on the clustering algorithm. This study analyzes the social network information dissemination mechanism in the field of college students’ sports, obtains the real-time social behavior data in the network environment combined with the analysis results, and processes the obtained social network behavior data from two aspects of data cleaning and de-duplication. Using (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  17.  44
    Passenger flow forecast for customized bus based on time series fuzzy clustering algorithm.Ming Li, Linlin Wang, Jingfeng Yang, Zhenkun Zhang, Nanfeng Zhang, Yifei Xiang & Handong Zhou - 2019 - Interaction Studies 20 (1):42-60.
    Customized bus services are conducive to improving urban traffic and environment, and have attracted widespread attention. However, the problems encountered in the new customized bus mode include the large difference between the basis of customized bus passenger flow data analysis and the basis of the traditional bus passenger flow data analysis, and the difficulty in different vehicle scheduling caused by the combination of traditional and customized bus modes. We propose a customized bus passenger flow analysis algorithm and (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  18.  26
    Enhancement of K-means clustering in big data based on equilibrium optimizer algorithm.Omar Saber Qasim, Zakariya Yahya Algamal & Sarah Ghanim Mahmood Al-Kababchee - 2023 - Journal of Intelligent Systems 32 (1).
    Data mining’s primary clustering method has several uses, including gene analysis. A set of unlabeled data is divided into clusters using data features in a clustering study, which is an unsupervised learning problem. Data in a cluster are more comparable to one another than to those in other groups. However, the number of clusters has a direct impact on how well the K-means algorithm performs. In order to find the best solutions for these (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  19.  14
    Using Big Data Fuzzy K-Means Clustering and Information Fusion Algorithm in English Teaching Ability Evaluation.Chen Zhen - 2021 - Complexity 2021:1-9.
    Aiming at the problem of inaccurate classification of big data information in traditional English teaching ability evaluation algorithms, an English teaching ability evaluation algorithm based on big data fuzzy K-means clustering and information fusion is proposed. Firstly, the author uses the idea of K-means clustering to analyze the collected original error data, such as teacher level, teaching facility investment, and policy relevance level, removes the data that the algorithm considers unreliable, uses the (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  20.  13
    Finding Clusters and Outliers for Data Sets with Constraints.Yong Shi - 2011 - Journal of Intelligent Systems 20 (1):3-14.
    In this paper, we present our research on data mining approaches with the existence of obstacles. Although there are a lot of algorithms designed to detect clusters with obstacles, few algorithms can detect clusters and outliers simultaneously and interactively. We here extend our original research [Shi, Zhang, Towards Exploring Interactive Relationship between Clusters and Outliers in Multi-Dimensional Data Analysis, 518–519: IEEE Computer Society, 2005] on iterative cluster and outlier detection to study the problem of detecting cluster and outliers (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  21.  21
    From instance-level constraints to space-level constraints: Making the most of prior knowledge in data clustering.Dan Klein & Christopher D. Manning - unknown
    We present an improved method for clustering in the presence of very limited supervisory information, given as pairwise instance constraints. By allowing instance-level constraints to have spacelevel inductive implications, we are able to successfully incorporate constraints for a wide range of data set types. Our method greatly improves on the previously studied constrained -means algorithm, generally requiring less than half as many constraints to achieve a given accuracy on a range of real-world data, while also being (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  22.  22
    The potential of eye tracking data to strengthen CDA’ explanatory power: the case of multimodal critical discourse analysis of advertising persuasion.Yixiong Chen & Csilla Weninger - forthcoming - Critical Discourse Studies.
    Multimodal Critical Discourse Analysis (MCDA) as a sub-discipline of Critical Discourse Analysis (CDA) emerged from the availability of social semiotic frameworks describing multimodal meaning making. However, weaknesses of these frameworks have raised concerns and prompted recent methodological reflections in MCDA. Inspired by these reflections, this paper critically assesses MCDA research on advertising persuasion and identifies a lack of attention in studies to account for the social and ideological impact of advertising. This shortcoming is argued to be attributable to the weak (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  23.  21
    An Improved Integrated Clustering Learning Strategy Based on Three-Stage Affinity Propagation Algorithm with Density Peak Optimization Theory.Limin Wang, Wenjing Sun, Xuming Han, Zhiyuan Hao, Ruihong Zhou, Jinglin Yu & Milan Parmar - 2021 - Complexity 2021:1-12.
    To better reflect the precise clustering results of the data samples with different shapes and densities for affinity propagation clustering algorithm, an improved integrated clustering learning strategy based on three-stage affinity propagation algorithm with density peak optimization theory was proposed in this paper. DPKT-AP combined the ideology of integrated clustering with the AP algorithm, by introducing the density peak theory and k-means algorithm to carry on the three-stage clustering process. In (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  24.  34
    Hybrid Efficient Genetic Algorithm for Big Data Feature Selection Problems.Tareq Abed Mohammed, Oguz Bayat, Osman N. Uçan & Shaymaa Alhayali - 2020 - Foundations of Science 25 (4):1009-1025.
    Due to the huge amount of data being generating from different sources, the analyzing and extracting of useful information from these data becomes a very complex task. The difficulty of dealing with big data optimization problems comes from many factors such as the high number of features, and the existing of lost data. The feature selection process becomes an important step in many data mining and machine learning algorithms to reduce the dimensionality of the optimization (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  25.  17
    On phantom publics, clusters, and collectives: be(com)ing subject in algorithmic times.Marie Petersmann & Dimitri Van Den Meerssche - forthcoming - AI and Society:1-18.
    This article starts from the observation that practices of ‘algorithmic governmentality’ or ‘governance by data’ are reconfiguring modes of social relationality and collectivity. By building, first, on an empirical exploration of digital bordering practices, we qualify these emergent algorithmic categories as ‘clusters’—pulsing patterns distilled from disaggregated data. As fluid, modular, and ever-emergent forms of association, these ‘clusters’ defy stable expressions of collective representation and social recognition. Second, we observe that this empirical analysis resonates with accounts that diagnosed algorithmic (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  26.  13
    An Optimized K-Harmonic Means Algorithm Combined with Modified Particle Swarm Optimization and Cuckoo Search Algorithm.Nacer Farajzadeh & Asgarali Bouyer - 2019 - Journal of Intelligent Systems 29 (1):1-18.
    Among the data clustering algorithms, the k-means (KM) algorithm is one of the most popular clustering techniques because of its simplicity and efficiency. However, KM is sensitive to initial centers and it has a local optima problem. The k-harmonic means (KHM) clustering algorithm solves the initialization problem of the KM algorithm, but it also has a local optima problem. In this paper, we develop a new algorithm for solving this problem based on (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  27.  27
    Event Mining Through Clustering.T. V. Geetha & E. Umamaheswari - 2014 - Journal of Intelligent Systems 23 (1):59-73.
    Traditional document clustering algorithms consider text-based features such as unique word count, concept count, etc. to cluster documents. Meanwhile, event mining is the extraction of specific events, their related sub-events, and the associated semantic relations from documents. This work discusses an approach to event mining through clustering. The Universal Networking Language -based subgraph, a semantic representation of the document, is used as the input for clustering. Our research focuses on exploring the use of three different feature sets (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  28.  29
    Clustering Methods Using Distance-Based Similarity Measures of Single-Valued Neutrosophic Sets.Jun Ye - 2014 - Journal of Intelligent Systems 23 (4):379-389.
    Clustering plays an important role in data mining, pattern recognition, and machine learning. Single-valued neutrosophic sets are useful means to describe and handle indeterminate and inconsistent information that fuzzy sets and intuitionistic fuzzy sets cannot describe and deal with. To cluster the data represented by single-valued neutrosophic information, this article proposes single-valued neutrosophic clustering methods based on similarity measures between SVNSs. First, we define a generalized distance measure between SVNSs and propose two distance-based similarity measures of (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   11 citations  
  29. A Hybrid Fuzzy Wavelet Neural Network Model with Self-Adapted Fuzzy c-Means Clustering and Genetic Algorithm for Water Quality Prediction in Rivers.Mingzhi Huang, Hongbin di TianLiu, Chao Zhang, Xiaohui Yi, Jiannan Cai, Jujun Ruan, Tao Zhang, Shaofei Kong & Guangguo Ying - 2018 - Complexity 2018:1-11.
    Water quality prediction is the basis of water environmental planning, evaluation, and management. In this work, a novel intelligent prediction model based on the fuzzy wavelet neural network including the neural network, the fuzzy logic, the wavelet transform, and the genetic algorithm was proposed to simulate the nonlinearity of water quality parameters and water quality predictions. A self-adapted fuzzy c-means clustering was used to determine the number of fuzzy rules. A hybrid learning algorithm based on a genetic (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  30.  14
    A hybrid fuzzy clustering approach for diagnosing primary headache disorder.Svetlana Simić, Zorana Banković, José R. Villar, Dragan Simić & Svetislav D. Simić - 2021 - Logic Journal of the IGPL 29 (2):220-235.
    Clustering is one of the most fundamental and essential data analysis tasks with broad applications. It has been studied in various research fields: data mining, machine learning, pattern recognition and in engineering, economics and biomedical data analysis. Headache is not a disease that typically shortens one’s life, but it can be a serious social as well as a health problem. Approximately 27 billion euros per year are lost through reduced work productivity in the European community. This (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  31.  23
    Improved FCM Algorithm Based on K-Means and Granular Computing.Zhuang Zhi Yan & Wei Jia Lu - 2015 - Journal of Intelligent Systems 24 (2):215-222.
    The fuzzy clustering algorithm has been widely used in the research area and production and life. However, the conventional fuzzy algorithms have a disadvantage of high computational complexity. This article proposes an improved fuzzy C-means algorithm based on K-means and principle of granularity. This algorithm is aiming at solving the problems of optimal number of clusters and sensitivity to the data initialization in the conventional FCM methods. The initialization stage of the K-medoid cluster, which is (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  32.  33
    Every word you say: algorithmic mediation and implications of data-driven scholarly communication.Luciana Monteiro-Krebs, Bieke Zaman, David Geerts & Sônia Elisa Caregnato - 2023 - AI and Society 38 (2):1003-1012.
    Implications of algorithmic mediation can be studied through the artefact itself, peoples’ practices, and the social/political/economical arrangements that affect and are affected by such interactions. Most studies in Academic social media (ASM) focus on one of these elements at a time, either examining design elements or the users’ behaviour on and perceptions of such platforms. We take a multi-faceted approach using affordances as a lens to analyze practices and arrangements traversed by algorithmic mediation. Following our earlier studies that examined the (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  33.  29
    Conceptual mapping through keyword coupled clustering.Zvika Marx & Ido Dagan - 2001 - Mind and Society 2 (2):59-85.
    This paper introduces coupled clustering—a novel computational framework for detecting corresponding themes in unstructured data. Gaining its inspiration from the structure mapping theory, our framework utilizes unsupervised statistical learning tools for automatic construction of aligned representations reflecting the context of the particular mapping being made. The coupled clustering algorithm is demonstrated and evaluated through detecting conceptual correspondences in textual corpora. In its current phase, the method is primarily oriented towards context-dependent feature-based similarity. However, it is preliminary (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  34. Real-World Applications of Evolutionary Computation Techniques-Clustering Protein Interaction Data Through Chaotic Genetic Algorithm.Hongbiao Liu & Juan Liu - 2006 - In O. Stock & M. Schaerf (eds.), Lecture Notes In Computer Science. Springer Verlag. pp. 4247--858.
    No categories
     
    Export citation  
     
    Bookmark  
  35. Evolutionary Discovery of Fuzzy Concepts in Data.Lewis L. H. Chung & Keith C. C. Chan - 2003 - Brain and Mind 4 (2):253-268.
    Given a set of objects characterized by a number of attributes, hidden patterns can be discovered in them for the grouping of similar objects into clusters. If each of these clusters can be considered as exemplifying a certain concept, then the problem concerned can be referred to as a concept discovery problem. This concept discovery problem can be solved to some extent by existing data clustering techniques. However, they may not be applicable when the concept involved is vague (...)
    Direct download (7 more)  
     
    Export citation  
     
    Bookmark  
  36.  71
    Clustering the Tagged Web.Christopher D. Manning - unknown
    Automatically clustering web pages into semantic groups promises improved search and browsing on the web. In this paper, we demonstrate how user-generated tags from largescale social bookmarking websites such as del.icio.us can be used as a complementary data source to page text and anchor text for improving automatic clustering of web pages. This paper explores the use of tags in 1) K-means clustering in an extended vector space model that includes tags as well as page text (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  37.  18
    Using Clustering Analysis and Association Rule Technology in Cross-Marketing.Yang Cheng, Ming Cheng, Tao Pang & Sizhen Liu - 2021 - Complexity 2021:1-11.
    In this paper, according to the perspective of customers and products, by using clustering analysis and association rule technology, this paper proposes a cross-marketing model based on an improved sequential pattern mining algorithm, where an improved algorithm AP is applied. The algorithm can reduce the time cost of constructing a projection database and the influence of the increase of support on the algorithm efficiency. The improved idea is that when the first partition is used to (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  38.  29
    Single-Valued Neutrosophic Minimum Spanning Tree and Its Clustering Method.Jun Ye - 2014 - Journal of Intelligent Systems 23 (3):311-324.
    Clustering plays an important role in data mining, pattern recognition, and machine learning. Then, single-valued neutrosophic sets are a useful means to describe and handle indeterminate and inconsistent information, which fuzzy sets and intuitionistic fuzzy sets cannot describe and deal with. To cluster the data represented by single-value neutrosophic information, the article proposes a single-valued neutrosophic minimum spanning tree clustering algorithm. Firstly, we defined a generalized distance measure between SVNSs. Then, we present an SVNMST (...) algorithm for clustering single-value neutrosophic data based on the generalized distance measure of SVNSs. Finally, two illustrative examples are given to demonstrate the application and effectiveness of the developed approach. (shrink)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   13 citations  
  39.  8
    False Financial Statement Identification Based on Fuzzy C-Means Algorithm.Jixiao Li - 2021 - Complexity 2021:1-11.
    Financial accountants falsify financial statements by means of financial techniques such as financial practices and financial standards, and when compared with conventional financial data, it is found that the falsified financial data often lack correlation or even contradict each other in terms of financial data indicators. At the same time, there are also inherent differences in reporting patterns from conventional financial data, but these differences are difficult to test manually. In this paper, the fuzzy C-means (...) method is used to amplify these differences and thus identify false financial statements. In the proposed algorithm, firstly, the normalization constraint of the FCM clustering algorithm on the sum of affiliation of individual samples is relaxed to the constraint on the sum of affiliation of all samples, thus reducing the sensitivity of the algorithm to noise and isolated points; secondly, a new affiliation correction method is proposed to address the problem that the difference in affiliation is too large after the relaxation of the constraint. In the discussion of this paper, most of the information comes from the annual reports of companies, administrative penalty decisions of the Securities Regulatory Commission, and some information comes from research reports made by securities institutions, which are limited sources of information. The proposed method can correct the affiliation to a reasonable range, effectively avoiding the problem that some samples have too much affiliation and become a class of their own and also avoiding the problem that it is difficult to choose the termination threshold of the algorithm iteration due to too little affiliation, and can ensure that the constraint on the sum of affiliation of all samples is always satisfied during the iteration of the algorithm. The method has the characteristics of high recognition accuracy and has the significance of theoretical method innovation. (shrink)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  40.  20
    Efficient Time Series Clustering and Its Application to Social Network Mining.Qianchuan Zhao & Cangqi Zhou - 2014 - Journal of Intelligent Systems 23 (2):213-229.
    Mining time series data is of great significance in various areas. To efficiently find representative patterns in these data, this article focuses on the definition of a valid dissimilarity measure and the acceleration of partitioning clustering, a common group of techniques used to discover typical shapes of time series. Dissimilarity measure is a crucial component in clustering. It is required, by some particular applications, to be invariant to specific transformations. The rationale for using the angle between (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  41.  13
    Design of intelligent acquisition system for moving object trajectory data under cloud computing.Ioan-Cosmin Mihai, Shaweta Khanna, Sudeep Asthana, Abhinav Asthana & Yang Zhang - 2021 - Journal of Intelligent Systems 30 (1):763-773.
    In order to study the intelligent collection system of moving object trajectory data under cloud computing, information useful to passengers and taxi drivers is collected from massive trajectory data. This paper uses cloud computing technology, through clustering algorithm and density-based DBSCAN algorithm combined with Map Reduce programming model and design trajectory clustering algorithm. The results show that based on the 8-day data of 15,000 taxis in Shenzhen, the characteristic time period is determined. (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  42.  16
    Algorithms and dehumanization: a definition and avoidance model.Mario D. Schultz, Melanie Clegg, Reto Hofstetter & Peter Seele - forthcoming - AI and Society:1-21.
    Dehumanization by algorithms raises important issues for business and society. Yet, these issues remain poorly understood due to the fragmented nature of the evolving dehumanization literature across disciplines, originating from colonialism, industrialization, post-colonialism studies, contemporary ethics, and technology studies. This article systematically reviews the literature on algorithms and dehumanization (n = 180 articles) and maps existing knowledge across several clusters that reveal its underlying characteristics. Based on the review, we find that algorithmic dehumanization is particularly problematic for human resource management (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  43.  14
    Data Analysis Method of Intelligent Analysis Platform for Big Data of Film and Television.Youwen Ma & Yi Wan - 2021 - Complexity 2021:1-10.
    Based on cloud computing and statistics theory, this paper proposes a reasonable analysis method for big data of film and television. The method selects Hadoop open source cloud platform as the basis, combines the MapReduce distributed programming model and HDFS distributed file storage system and other key cloud computing technologies. In order to cope with different data processing needs of film and television industry, association analysis, cluster analysis, factor analysis, and K-mean + association analysis algorithm training model (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  44.  12
    A Gaussian Process Latent Variable Model for Subspace Clustering.Shangfang Li - 2021 - Complexity 2021:1-7.
    Effective feature representation is the key to success of machine learning applications. Recently, many feature learning models have been proposed. Among these models, the Gaussian process latent variable model for nonlinear feature learning has received much attention because of its superior performance. However, most of the existing GPLVMs are mainly designed for classification and regression tasks, thus cannot be used in data clustering task. To address this issue and extend the application scope, this paper proposes a novel GPLVM (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  45. Occam's Razor For Big Data?Birgitta Dresp-Langley - 2019 - Applied Sciences 3065 (9):1-28.
    Detecting quality in large unstructured datasets requires capacities far beyond the limits of human perception and communicability and, as a result, there is an emerging trend towards increasingly complex analytic solutions in data science to cope with this problem. This new trend towards analytic complexity represents a severe challenge for the principle of parsimony (Occam’s razor) in science. This review article combines insight from various domains such as physics, computational science, data engineering, and cognitive science to review the (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  46.  35
    Research on Hybrid Collaborative Filtering Recommendation Algorithm Based on the Time Effect and Sentiment Analysis.Xibin Wang, Zhenyu Dai, Hui Li & Jianfeng Yang - 2021 - Complexity 2021:1-11.
    In this study, we focus on the problem of information expiration when using the traditional collaborative filtering algorithm and propose a new collaborative filtering algorithm by integrating the time factor. This algorithm considers information influence attenuation over time, introduces an information retention period based on the information half-value period, and proposes a time-weighted function, which is applied to the nearest neighbor selection and score prediction to assign different time weights to the scores. In addition, to further improve (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  47.  24
    Social media and microtargeting: Political data processing and the consequences for Germany.Orestis Papakyriakopoulos, Simon Hegelich, Morteza Shahrezaye & Juan Carlos Medina Serrano - 2018 - Big Data and Society 5 (2).
    Amongst other methods, political campaigns employ microtargeting, a specific technique used to address the individual voter. In the US, microtargeting relies on a broad set of collected data about the individual. However, due to the unavailability of comparable data in Germany, the practice of microtargeting is far more challenging. Citizens in Germany widely treat social media platforms as a means for political debate. The digital traces they leave through their interactions provide a rich information pool, which can create (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  48.  10
    Optimization of shared bike paths considering faulty vehicle recovery during dispatch.Donghao Shi - 2022 - Journal of Intelligent Systems 31 (1):1024-1036.
    With the rapid development of China’s social economy and the improvement of the level of urbanization, urban transportation has also been greatly developed. With the booming development of the internet and the sharing economy industry, shared bicycles have emerged as the times requirement. Shared bicycles are a new type of urban transportation without piles. As a green way of travel, shared bicycles have the advantages of convenience, fashion, green, and environmental protection. However, many problems have also arisen in the use (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  49.  11
    Music Personalized Label Clustering and Recommendation Visualization.Yongkang Huo - 2021 - Complexity 2021:1-8.
    With the advent of big data, the performance of traditional recommendation algorithms is no longer enough to meet the demand. Most people do not leave too many comments and other data when using the application. In this case, the user data are too scattered and discrete, with obvious data sparsity problems. First, this paper describes the main ideas and methods used in current recommendation systems and summarizes the areas that need attention and consideration. Based on these (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  50.  35
    Increasing Energy Efficiency in Wireless Sensor Networks Using GA-ANFIS to Choose a Cluster Head and Assess Routing and Weighted Trusts to Demodulate Attacker Nodes.Shaymaa Al Hayali, Javad Rahebi, Osman N. Ucan & Oguz Bayat - 2020 - Foundations of Science 25 (4):1227-1246.
    Demodulating harmful nodes and diminishing the energy waste in sensor nodes can prolong the lifespan of wireless sensor networks. In this study, a genetic algorithm and an adaptive neuro fuzzy inference system were used to diminish the energy waste of sensors. Weighted trust evaluation was applied to search for harmful nodes in the network to prolong the lifespan of WSNs. A low-energy adaptive clustering hierarchy method was used to analyze the results. It was discovered that searching for harmful (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
1 — 50 / 994