Web Search Result Clustering based on Heuristic Search and k-means

Published 2015 in arXiv.org

ABSTRACT

Giving user a simple and well organized web search result has been a topic of active information Retrieval (IR) research. Irrespective of how small or ambiguous a query is, a user always wants the desired result on the first display of an IR system. Clustering of an IR system result can render a way, which fulfills the actual information need of a user. In this paper, an approach to cluster an IR system result is presented.The approach is a combination of heuristics and k-means technique using cosine similarity. Our heuristic approach detects the initial value of k for creating initial centroids. This eliminates the problem of external specification of the value k, which may lead to unwanted result if wrongly specified. The centroids created in this way are more specific and meaningful in the context of web search result. Another advantage of the proposed method is the removal of the objective means function of k-means which makes cluster sizes same. The end result of the proposed approach consists of different clusters of documents having different sizes.

PUBLICATION RECORD

Publication year
2015
Venue
arXiv.org
Publication date
2015-08-11
Fields of study
Computer Science
Identifiers
arXiv 1508.02552
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Labeling of Web Search Result Clusters Using Heuristic Search and Frequent Itemset
2015cited by this paper
Clustering results of image searches by annotations and visual features
2014cited by this paper
Incremental Beam search
2013cited by this paper
A new statistical strategy for pooling: ELI
2013cited by this paper
A Sampling-PSO-K-means Algorithm for Document Clustering
2013cited by this paper
Optimal clustering in the context of overlapping cluster analysis
2013cited by this paper
Fast global k-means clustering based on local geometrical information
2013cited by this paper
Black hole: A new heuristic optimization approach for data clustering
2013cited by this paper
Efficient stochastic algorithms for document clustering
2013cited by this paper
An Introduction to Information Retrieval
2013cited by this paper
Web Search Result Clustering using Heuristic Search and Latent Semantic Indexing
2012cited by this paper
A Review on Clustering of Web Search Result
2012cited by this paper
Understanding Search Engines
2012cited by this paper
Using semantic techniques to access web data
2011cited by this paper
A Novel Approach for Organizing Web Search Results using Ranking and Clustering
2010cited by this paper
Semantic Suffix Tree Clustering
2010cited by this paper
A survey of Web clustering engines
2009cited by this paper
A comparison of extrinsic clustering evaluation metrics based on formal constraints
2009influential reference
What Users See - Structures in Search Engine Results Pages
2009cited by this paper
Search Result Clustering via Randomized Partitioning of Query-Induced Subgraphs
2008cited by this paper
A new algorithm for clustering search results
2007cited by this paper
Enhancing K-Means Algorithm with Initial Cluster Centers Derived from Data Partitioning along the Data Axis with the Highest Variance
2007cited by this paper
Web Page Clustering Using Heuristic Search in the Web Graph
2007cited by this paper
Understanding Search Engines: Mathematical Modeling and Text Retrieval (Software, Environments, Tools), Second Edition
2005cited by this paper
Efficient online spherical k-means clustering
2005cited by this paper
A concept-driven algorithm for clustering search results
2005cited by this paper
Semantic, Hierarchical, Online Clustering of Web Search Results
2004cited by this paper
An Efficient k-Means Clustering Algorithm: Analysis and Implementation
2002cited by this paper
Efficient Clustering of Very Large Document Collections
2001cited by this paper
Spatial clustering methods in data mining : A survey
2001cited by this paper
Link Based Clustering of Web Search Results
2001cited by this paper
A Comparison of Document Clustering Techniques
2000cited by this paper
Understanding search engines: mathematical modeling and text retrieval (software
1999cited by this paper
Web document clustering: a feasibility demonstration
1998cited by this paper
Refining Initial Points for K-Means Clustering
1998cited by this paper
Scatter/Gather: a cluster-based approach to browsing large document collections
1992cited by this paper
Research and Development in Information Retrieval
1982cited by this paper

CITED BY

Enhancing web search result clustering model based on multiview multirepresentation consensus cluster ensemble (mmcc) approach
2021cites this paper
A HYBRID APPROACH FOR WEB SEARCH RESULT CLUSTERING BASED ON GENETIC ALGORITHM WITH K-MEANS
2021cites this paper
Enhanced clustering models with wiki-based k-nearest neighbors-based representation for web search result clustering
2020cites this paper
Ensemble machine learning approaches for webshell detection in Internet of things environments
2020cites this paper