Research Topics List

CSC424/334 - Advanced Data Analysis

Following is a list of topics that you may want to consider for your in-class presentation and research paper. You are not limited to topics from this list. You may present on any topic directly related to Multivariate Data Analysis.

Remember that it is important to start thinking about a topic now. Additional details about schedule and format have been posted (see the Presentation/Research page). Remember that the first presentations will be scheduled on week 9. Also, you may work in groups of no more than two individuals (see last item on Presentation/Research page).
Note: Where possible, the papers below were either distributed in class and/or links to the papers in digital libraries have been provided. These libraries usually restrict access to subscribers but in some cases they may be accessed without charge through any workstation on the DePaul network including DPO subscribers.

Topics:

  1. Multiple Regression:
    There are several possibilities. See the Software Engineering survey paper by Fenton that was distributed and discussed in class. This paper has several good references. This is also a topic where you may want to do an implementation. For example, you may implement an IML module to do model selection. For a model selection algorithm that you could try implementing, see the "Predicting Fault Detection Effectiveness" paper by Joseph Morgan et al., Proceedings of the Fourth International Software Metrics Symposium (1997).

  2. Principal Component & Factor Analysis:
    There are several possibilities. See the papers handed out and discussed in class. This is a topic where you may want to analyze data that you are interested in studying. For example, you could try doing an analysis similar to that discussed in the "Somebody Wrote Shakespeare" paper by Ware Myers, IEEE Expert (1990). Also, this is a topic that lends itself to comparisons with other methods, a discussion of related methods, or an implementation using this method. Other papers of interest are:

  3. Linear Discriminant Analysis:
    There are several possibilities. See the papers handed out and discussed in class. This is a topic where you may want to analyze data that you are interested in studying. Also, this is a topic that lends itself to comparisons with other methods or a discussion of related methods.

  4. Cluster Analysis:
    There are several possibilities. See the papers handed out and discussed in class for ideas. This is a topic where you may want to analyze data that you are interested in studying. Also, this is a topic that lends itself to comparisons of various clustering algorithms. For an interesting application of hierarchical clustering, see the "Interactively Exploring Hierarchical Clustering Results" paper by Jinwook Seo et al., IEEE Computer (2002). Also, see the "An Efficient k-Means Clustering Algorithm: Analysis & Implementation" paper by Tapas Kanungo et al., IEEE Transactions on Pattern Analysis & Machine Intelligence (2002).
Note: the following journals are good sources of ideas. These journals are all available through the DePaul electronic journal library. Use the journal name as the search string.