Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Cancer gene expression data analysis using rough based symmetrical clustering

Sarkar, Anasua LU orcid and Maulik, Ujjwal (2012) p.699-715
Abstract

Identification of cancer subtypes is the central goal in the cancer gene expression data analysis. Modified symmetry-based clustering is an unsupervised learning technique for detecting symmetrical convex or non-convex shaped clusters. To enable fast automatic clustering of cancer tissues (samples), in this chapter, the authors propose a rough set based hybrid approach for modified symmetry-based clustering algorithm. A natural basis for analyzing gene expression data using the symmetry-based algorithm is to group together genes with similar symmetrical patterns of microarray expressions. Rough-set theory helps in faster convergence and initial automatic optimal classification, thereby solving the problem of unknown knowledge of number... (More)

Identification of cancer subtypes is the central goal in the cancer gene expression data analysis. Modified symmetry-based clustering is an unsupervised learning technique for detecting symmetrical convex or non-convex shaped clusters. To enable fast automatic clustering of cancer tissues (samples), in this chapter, the authors propose a rough set based hybrid approach for modified symmetry-based clustering algorithm. A natural basis for analyzing gene expression data using the symmetry-based algorithm is to group together genes with similar symmetrical patterns of microarray expressions. Rough-set theory helps in faster convergence and initial automatic optimal classification, thereby solving the problem of unknown knowledge of number of clusters in gene expression measurement data. For rough-settheoretic decision rule generation, each cluster is classified using heuristically searched optimal reducts to overcome overlapping cluster problem. The rough modified symmetry-based clustering algorithm is compared with another newly implemented rough-improved symmetry-based clustering algorithm and existing K-Means algorithm over five benchmark cancer gene expression data sets, to demonstrate its superiority in terms of validity. The statistical analyses are also performed to establish the significance of this rough modified symmetry-based clustering approach.

(Less)
Please use this url to cite or link to this publication:
author
and
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
host publication
Handbook of Research on Computational Intelligence for Engineering, Science, and Business
pages
699 - 715
publisher
IGI Global
external identifiers
  • scopus:84898504029
ISBN
9781466625198
9781466625181
DOI
10.4018/978-1-4666-2518-1.ch027
language
English
LU publication?
no
id
9e521c44-b3f9-463f-997e-2584a7f237bd
date added to LUP
2018-10-09 09:56:13
date last changed
2024-06-10 19:14:38
@inbook{9e521c44-b3f9-463f-997e-2584a7f237bd,
  abstract     = {{<p>Identification of cancer subtypes is the central goal in the cancer gene expression data analysis. Modified symmetry-based clustering is an unsupervised learning technique for detecting symmetrical convex or non-convex shaped clusters. To enable fast automatic clustering of cancer tissues (samples), in this chapter, the authors propose a rough set based hybrid approach for modified symmetry-based clustering algorithm. A natural basis for analyzing gene expression data using the symmetry-based algorithm is to group together genes with similar symmetrical patterns of microarray expressions. Rough-set theory helps in faster convergence and initial automatic optimal classification, thereby solving the problem of unknown knowledge of number of clusters in gene expression measurement data. For rough-settheoretic decision rule generation, each cluster is classified using heuristically searched optimal reducts to overcome overlapping cluster problem. The rough modified symmetry-based clustering algorithm is compared with another newly implemented rough-improved symmetry-based clustering algorithm and existing K-Means algorithm over five benchmark cancer gene expression data sets, to demonstrate its superiority in terms of validity. The statistical analyses are also performed to establish the significance of this rough modified symmetry-based clustering approach.</p>}},
  author       = {{Sarkar, Anasua and Maulik, Ujjwal}},
  booktitle    = {{Handbook of Research on Computational Intelligence for Engineering, Science, and Business}},
  isbn         = {{9781466625198}},
  language     = {{eng}},
  month        = {{12}},
  pages        = {{699--715}},
  publisher    = {{IGI Global}},
  title        = {{Cancer gene expression data analysis using rough based symmetrical clustering}},
  url          = {{http://dx.doi.org/10.4018/978-1-4666-2518-1.ch027}},
  doi          = {{10.4018/978-1-4666-2518-1.ch027}},
  year         = {{2012}},
}