Skip to content
#

data-mining

Here are 5,160 public repositories matching this topic...

This project aims to compare traditional Machine Learning methods for tabular data classification, such as Ensemble methods, Decision Trees, and Naive Bayes, with NLP classification methods like Multinomial Naive Bayes, RNNs, and Transformers. We are utilizing survey data from the CDC via the Behavioral Risk Factor Surveillance System (BRFSS)

  • Updated Jun 11, 2024
  • Jupyter Notebook

A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4.5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting, Random Forest and Adaboost w/categorical features support for Python

  • Updated Jun 11, 2024
  • Python

Improve this page

Add a description, image, and links to the data-mining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-mining topic, visit your repo's landing page and select "manage topics."

Learn more