Fair Algorithms for Clustering

Suman Kalyan Bera,Deeparnab Chakrabarty,Maryam Negahbani

Published 2019 in Neural Information Processing Systems

ABSTRACT

We study the problem of finding low-cost Fair Clusterings in data where each data point may belong to many protected groups. Our work significantly generalizes the seminal work of Chierichetti this http URL. (NIPS 2017) as follows. - We allow the user to specify the parameters that define fair representation. More precisely, these parameters define the maximum over- and minimum under-representation of any group in any cluster. - Our clustering algorithm works on any $\ell_p$-norm objective (e.g. $k$-means, $k$-median, and $k$-center). Indeed, our algorithm transforms any vanilla clustering solution into a fair one incurring only a slight loss in quality. - Our algorithm also allows individuals to lie in multiple protected groups. In other words, we do not need the protected groups to partition the data and we can maintain fairness across different groups simultaneously. Our experiments show that on established data sets, our algorithm performs much better in practice than what our theoretical results suggest.

PUBLICATION RECORD

Publication year
2019
Venue
Neural Information Processing Systems
Publication date
2019-01-08
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1901.02393
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

The (Im)possibility of fairness
2021cited by this paper
Fair k-Center Clustering for Data Summarization
2019cited by this paper
Proportionally Fair Clustering
2019cited by this paper
Tight FPT Approximations for $k$-Median and k-Means
2019cited by this paper
Matroids, Matchings, and Fairness
2019cited by this paper
Guarantees for Spectral Clustering with Fairness Constraints
2019cited by this paper
Scalable Fair Clustering
2019influential reference
Fair Clustering Through Fairlets
2018influential reference
Privacy preserving clustering with constraints
2018influential reference
On the cost of essentially fair clusterings
2018cited by this paper
Generalized Center Problems with Outliers
2018cited by this paper
Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees
2018cited by this paper
Constant factor FPT approximation for capacitated k-median
2018cited by this paper
Fair Coresets and Streaming Algorithms for Fair k-Means Clustering
2018cited by this paper
The accuracy, fairness, and limits of predicting recidivism
2018cited by this paper
Ranking with Fairness Constraints
2017influential reference
Multiwinner Voting with Fairness Constraints
2017influential reference
Inherent Trade-Offs in the Fair Determination of Risk Scores
2016cited by this paper
Fairness in Learning: Classic and Contextual Bandits
2016cited by this paper
Measuring Fairness in Ranked Outputs
2016cited by this paper
Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments
2016cited by this paper
Better Guarantees for k-Means and Euclidean k-Median by Primal-Dual Algorithms
2016cited by this paper
Approximation Algorithms for Clustering Problems with Lower Bounds and Outliers
2016cited by this paper
False Positives, False Negatives, and False Analyses: A Rejoinder to "Machine Bias: There's Software Used across the Country to Predict Future Criminals. and It's Biased against Blacks"
2016cited by this paper
Faster Algorithms for the Constrained k-means Problem
2015cited by this paper
Fairness Constraints: Mechanisms for Fair Classification
2015cited by this paper
A Unified Framework for Clustering Constrained Data Without Locality Property
2015cited by this paper
Impact of HbA1c Measurement on Hospital Readmission Rates: Analysis of 70,000 Clinical Database Patient Records
2014cited by this paper
An Improved Approximation for k-Median and Positive Correlation in Budgeted Optimization
2014influential reference
A data-driven approach to predict the success of bank telemarketing
2014cited by this paper
Certifying and Removing Disparate Impact
2014influential reference
Improved Approximation Algorithms for Matroid and Knapsack Median Problems and Applications
2013cited by this paper
Learning Fair Representations
2013cited by this paper
Machine learning for targeted display advertising: transfer learning in action
2013cited by this paper
Approximating k-median via pseudo-approximation
2012influential reference
Fairness-Aware Classifier with Prejudice Remover Regularizer
2012cited by this paper
Data Clustering : Algorithms and Applications
2012cited by this paper
Improved Approximation Guarantees for Lower-Bounded Facility Location
2011cited by this paper
k-NN as an implementation of situation testing for discrimination discovery and prevention
2011cited by this paper
Scikit-learn: Machine Learning in Python
2011cited by this paper
Fairness through awareness
2011cited by this paper
Three naive Bayes approaches for discrimination-free classification
2010cited by this paper
Consumer Credit Risk Models Via Machine-Learning Algorithms
2010cited by this paper
The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients
2009cited by this paper
Lower-bounded facility location
2008cited by this paper
Simpler Analyses of Local Search Algorithms for Facility Location
2008cited by this paper
Degree bounded matroids and submodular flows
2008influential reference
k-means++: the advantages of careful seeding
2007cited by this paper
Uniform Guidelines on Employee Selection Procedures
2007cited by this paper
Achieving anonymity via clustering
2006influential reference
Local Search Heuristics for k-Median and Facility Location Problems
2004cited by this paper
Evaluating Consumer Loans using Neural Networks
2003cited by this paper
The Learning-Curve Sampling Method Applied to Model-Based Clustering
2002cited by this paper
Approximation algorithms for metric facility location and k-Median problems using the primal-dual schema and Lagrangian relaxation
2001cited by this paper
The Capacitated K-Center Problem
2000cited by this paper
A constant-factor approximation algorithm for the k-median problem (extended abstract)
1999cited by this paper
Improved combinatorial algorithms for the facility location and k-median problems
1999cited by this paper
Applied Psychology in Human Resource Management
1998cited by this paper
Scaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid
1996cited by this paper
An approximation algorithm for the generalized assignment problem
1993cited by this paper
Clustering to Minimize the Maximum Intercluster Distance
1985cited by this paper
A Best Possible Heuristic for the k-Center Problem
1985cited by this paper
Easy and hard bottleneck location problems
1979cited by this paper

CITED BY

Fair principal component analysis via eigenvalue optimization
2026cites this paper
A Generic Framework for Fair Consensus Clustering in Streams
2026cites this paper
Fair Model-based Clustering
2026cites this paper
Fair and Skill-Diverse Student Group Formation: A graph-theoretic approach [Special Issue on Artificial Intelligence for Education: A Signal Processing Perspective]
2026cites this paper
Graph pre-processing method for fairness in spectral clustering
2026cites this paper
Uncovering algorithmic inequity: a conditional mutual information framework for detecting and mitigating hidden discrimination
2026cites this paper
Unifying Proportional Fairness in Centroid and Non-Centroid Clustering
2026cites this paper
Equitable Representation of Demographic Groups in Constrained Spectral Clustering
2025cites this paper
SoK: Fair Clustering: Critique, Caveats, and Future Directions
2025cites this paper
Approximation algorithm for prize-collecting weighted set cover with fairness constraints
2025cites this paper
Fair Clustering with Clusterlets
2025cites this paper
Relative Error Fair Clustering in the Weak-Strong Oracle Model
2025cites this paper
Improved Rank Aggregation under Fairness Constraint
2025cites this paper
FPT Constant Approximation Algorithms for Colorful Sum of Radii
2025influential citation
Group Fair Matchings using Convex Cost Functions
2025influential citation
EL-Clustering: Combining Upper- and Lower-Bounded Clusterings for Equitable Load Constraints
2025cites this paper
Improving Fairness in Density Peak Clustering through Fair Constraints and Multi-Objective Optimization Allocation
2025cites this paper
Can geodemographic clustering be fair? Incorporating social fairness in crisp and fuzzy approaches through a unified framework
2025cites this paper
A fair spectral clustering with weighted fairness constraints
2025cites this paper
A Fair Ensemble Clustering Method
2025cites this paper
Polynomial-Time Constant-Approximation for Fair Sum-of-Radii Clustering
2025cites this paper
Parameterized Approximation Algorithm for Doubly Constrained Fair Clustering
2025cites this paper
Privacy and Fairness in Machine Learning: A Survey
2025cites this paper
Welfare-Centric Clustering
2025influential citation
Fair Clustering via Alignment
2025influential citation
Fair Bayesian Model-Based Clustering
2025influential citation
Fair Clustering in the Sliding Window Model
2025cites this paper
Hidden Convexity of Fair PCA and Fast Solver via Eigenvalue Optimization
2025cites this paper
Towards Fair Decision Boundaries in Clustering: Integrating Disparate Impact Criteria into Maximum Margin Clustering
2025cites this paper
Accelerating Spectral Clustering under Fairness Constraints
2025influential citation
Airports and railways with unsplittable demand
2025cites this paper
Fairness in constrained spectral clustering
2025cites this paper
Fair Laplace: A unified framework for fair spectral clustering
2025cites this paper
Improved FPT Approximation for Sum of Radii Clustering with Mergeable Constraints
2025cites this paper
Euclidean k-center Fair Clusterings
2025cites this paper
MOUFLON: Multi-group Modularity-based Fairness-aware Community Detection
2025cites this paper
A Fair Label Propagation Community Detection Algorithm
2025cites this paper
Modularity-Fair Deep Community Detection
2025cites this paper
Fair Network Communities through Group Modularity
2025cites this paper
Recovering Fairness Directly from Modularity: a New Way for Fair Community Partitioning
2025cites this paper
Fairness-aware PageRank via Edge Reweighting
2025cites this paper
LARGE-SCALE FAIRNESS SPECTRAL CLUSTERING BASED ON LEARNABLE SUBSPACE FOR ELEVATOR SAFETY MANAGEMENT
2025cites this paper
A Computational Approach to Improving Fairness in K-means Clustering
2025cites this paper
Ensuring Fairness in Spectral Clustering via Disparate Impact-Based Graph Construction
2025cites this paper
Fair-Count-Min: Frequency Estimation under Equal Group-wise Approximation Factor
2025cites this paper
Near-feasible Fair Allocations in Two-sided Markets
2025cites this paper
Incorporating Fairness in Neighborhood Graphs for Fair Spectral Clustering
2025cites this paper
FACROC: a fairness measure for FAir Clustering through ROC curves
2025cites this paper
Robust self-supervised machine learning for single cell embeddings and annotations
2025cites this paper
A dual Laplacian framework with effective graph learning for unified fair spectral clustering
2024cites this paper
Fair Clustering: Critique, Caveats, and Future Directions
2024influential citation
Individual Fairness under Group Fairness Constraints in Bipartite Matching - One Framework to Approximate Them All
2024cites this paper
Robust Fair Clustering with Group Membership Uncertainty Sets
2024influential citation
Reproducibility study of "Robust Fair Clustering: A Novel Fairness Attack and Defense Framework"
2024cites this paper
Efficient k-means with Individual Fairness via Exponential Tilting
2024cites this paper
A Polynomial-Time Approximation for Pairwise Fair k-Median Clustering
2024influential citation
DFMVC: Deep Fair Multi-view Clustering
2024cites this paper
Individual Fairness in Graph Decomposition
2024cites this paper
Group Fairness and Multi-Criteria Optimization in School Assignment
2024cites this paper
Perceptual Fairness in Image Restoration
2024cites this paper
Fast and Accurate Fair k-Center Clustering in Doubling Metrics
2024cites this paper
Fair Federated Data Clustering through Personalization: Bridging the Gap between Diverse Data Distributions
2024cites this paper
Fair Clustering with Minimum Representation Constraints
2024influential citation
Fair Projections as a Means toward Balanced Recommendations
2024influential citation
From Discrete to Continuous: Deep Fair Clustering With Transferable Representations
2024cites this paper
Scalable Algorithms for Individual Preference Stable Clustering
2024cites this paper
Balanced Fair K-Means Clustering
2024influential citation
A Scalable Algorithm for Individually Fair K-means Clustering
2024cites this paper
Approximate Algorithms For k-Sparse Wasserstein Barycenter With Outliers
2024cites this paper
A Gibbs Posterior Framework for Fair Clustering
2024cites this paper
Fair Soft Clustering
2024cites this paper
Parameterized Approximation Schemes for Fair-Range Clustering
2024cites this paper
Proportionally Fair Matching via Randomized Rounding
2024cites this paper
Fair Kernel K-Means: from Single Kernel to Multiple Kernel
2024influential citation
FairHash: A Fair and Memory/Time-efficient Hashmap
2024cites this paper
Fair Summarization: Bridging Quality and Diversity in Extractive Summaries
2024cites this paper
On Socially Fair Low-Rank Approximation and Column Subset Selection
2024cites this paper
Relax and Merge: A Simple Yet Effective Framework for Solving Fair k-Means and k-sparse Wasserstein Barycenter Problems
2024influential citation
Ensemble clustering via dual self-enhancement by alternating denoising and topological consistency propagation
2024cites this paper
A Semidefinite Relaxation Approach for Fair Graph Clustering
2024influential citation
One-Stage Fair Multi-View Spectral Clustering
2024cites this paper
Oh the Prices You’ll See: Designing a Fair Exchange System to Mitigate Personalized Pricing
2024cites this paper
Local Causal Discovery with Background Knowledge
2024cites this paper
Fair k-center Clustering with Outliers
2024cites this paper
The Fairness-Quality Trade-off in Clustering
2024influential citation
Faster Approximation Schemes for (Constrained) k-Means with Outliers
2024cites this paper
FPT Approximations for Capacitated/Fair Clustering with Outliers
2023cites this paper
Clustering What Matters in Constrained Settings
2023cites this paper
Facility Relocation Search For Good: When Facility Exposure Meets User Convenience
2023cites this paper
Proportionally Representative Clustering
2023cites this paper
Fair Facility Location for Socially Equitable Representation
2023cites this paper
Parameterized Approximation Schemes for Clustering with General Norm Objectives
2023cites this paper
Fair Correlation Clustering in Forests
2023cites this paper
Achieving Long-term Fairness in Submodular Maximization through Randomization
2023cites this paper
Fair k-Center: a Coreset Approach in Low Dimensions
2023cites this paper
Improved Coresets for Clustering with Capacity and Fairness Constraints
2023influential citation
Proportionally Fair Matching with Multiple Groups
2023cites this paper
A review of clustering models in educational data science towards fairness-aware learning
2023influential citation
Fair and skill-diverse student group formation via constrained k-way graph partitioning
2023cites this paper
Fairness-Aware Clique-Preserving Spectral Clustering of Temporal Graphs
2023cites this paper