FedPrune-Sparse

FedPrune-Sparse is a reproducible federated learning simulation for studying the joint effect of client-side model pruning and client-to-server delta sparsification under non-IID data. The project is structured for thesis/dissertation experiments: each run stores configuration, detailed English logs, round-level metrics, model checkpoints, and publication-ready figures under results/.

Languages: English | فارسی

Research Objective

Standard FedAvg can be expensive for resource-constrained clients because every selected client trains the full model and transmits a dense update. FedPrune-Sparse separates these two costs:

Cost Source	Mechanism
Local computation	Prune the client model before local training.
Uplink communication	Sparsify the local model delta before transmission.

This separation supports controlled ablations:

Plain FedAvg.
FedAvg with pruning only.
FedAvg with sparsification only.
Hybrid pruning plus sparsification.

Federated Round Flow

Global model broadcast by server
        │
        ▼
Client-side model copy
        │
        ▼
Optional model pruning
        │
        ▼
Local training on non-IID client data
        │
        ▼
Delta = local weights - global weights
        │
        ▼
Optional delta sparsification with error feedback
        │
        ▼
Compressed delta sent to server
        │
        ▼
FedAvg aggregation and global model update

Project Structure

fedprune-sparse/
├── client/
│   └── client.py
├── data/
│   └── MNIST/
├── models/
│   └── cnn.py
├── scripts/
│   └── compare_ablations.py
├── server/
│   └── server.py
├── utils/
│   ├── gradient_sparsification.py
│   └── model_pruning.py
├── run_simulation.py
├── requirements.txt
├── README.md
└── README.fa.md

Path	Purpose
`models/cnn.py`	Compact MNIST CNN designed for transparent structured pruning.
`utils/model_pruning.py`	Unstructured pruning, structured pruning, and adaptive per-client pruning ratios.
`utils/gradient_sparsification.py`	Top-K, random, and cost-weighted sparsification with error feedback.
`client/client.py`	Client workflow: pruning, local training, delta computation, and sparsification.
`server/server.py`	Global model broadcast, FedAvg aggregation, and evaluation.
`run_simulation.py`	End-to-end experiment runner with logs, metrics, checkpoints, and plots.
`scripts/compare_ablations.py`	Automated high-level ablation suite.

Implemented Methods

Model Pruning

Unstructured pruning: masks low-magnitude weights in Conv2d and Linear layers while preserving tensor shapes.
Structured pruning: removes complete filters or hidden neurons and rebuilds a smaller SimpleCNN, reducing local memory/FLOP cost without requiring sparse tensor backends.
Adaptive pruning: assigns client-specific pruning ratios either from synthetic capability profiles or online round-time feedback.

Delta Sparsification

Top-K: keeps the largest absolute delta coordinates.
Random: keeps random coordinates as a baseline.
Cost-weighted: ranks coordinates by abs(delta) / cost, with higher default costs for fully connected parameters.
Error feedback: stores unsent delta residuals locally and reuses them in future rounds.

Installation

Python 3.10+ is recommended.

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

If you use a CUDA GPU, install the PyTorch build that matches your CUDA version before installing the remaining dependencies.

Quick Start

Run the default dissertation-oriented hybrid experiment:

python run_simulation.py

Run a faster smoke experiment:

python run_simulation.py \
  --rounds 2 \
  --num_clients 4 \
  --clients_per_round 2 \
  --local_epochs 1 \
  --run_name smoke_test

Run a plain FedAvg baseline:

python run_simulation.py \
  --no_pruning \
  --no_sparsification \
  --no_adaptive_ratio \
  --run_name baseline_fedavg

Run the full ablation suite:

python scripts/compare_ablations.py

Results and Logs

Each run creates a subdirectory under results/. If --run_name is provided, it is used as the subdirectory name; otherwise a timestamped name is generated.

Example:

python run_simulation.py --run_name hybrid_structured_cwmp

Output directory:

results/hybrid_structured_cwmp/
├── config.json
├── metrics.csv
├── run.log
├── summary.json
├── final_model.pt
├── accuracy_curve.png
├── compression_pruning_curve.png
├── round_time_curve.png
└── experiment_dashboard.png

File	Description
`config.json`	Full command-line configuration for the run.
`metrics.csv`	Round-by-round accuracy, compression, pruning, and timing metrics.
`run.log`	Detailed English execution log with per-round and per-client statistics.
`summary.json`	Final and best metrics plus artifact paths.
`final_model.pt`	Final global model checkpoint.
`*.png`	High-resolution plots saved at 300 DPI for analysis and reporting.

Key Metrics

Metric	Meaning
`test_accuracy`	Global model accuracy on the MNIST test set.
`avg_transmitted_ratio`	Average fraction of nonzero delta coordinates transmitted by selected clients.
`avg_pruning_ratio`	Average pruning ratio among selected clients.
`avg_round_time_sec`	Average local update time for selected clients.

Important Arguments

Argument	Default	Description
`--rounds`	`20`	Number of federated rounds.
`--num_clients`	`20`	Total simulated clients.
`--clients_per_round`	`10`	Selected clients per round.
`--local_epochs`	`2`	Local epochs per selected client.
`--pruning_mode`	`structured`	`structured` or `unstructured`.
`--pruning_ratio`	`0.4`	Base pruning ratio.
`--adaptive_ratio`	enabled	Enables per-client pruning ratios.
`--no_adaptive_ratio`	disabled	Disables adaptive pruning ratios.
`--sparsify_method`	`cost_weighted`	`topk`, `random`, or `cost_weighted`.
`--sparsity_ratio`	`0.95`	Fraction of delta coordinates to zero before transmission.
`--no_error_feedback`	disabled	Disables residual error feedback.
`--results_dir`	`./results`	Output directory for all artifacts.
`--run_name`	timestamp	Optional run subdirectory name.
`--device`	`cpu`	Training/evaluation device, e.g. `cuda`.

Reproducibility Notes

The random seed controls data shard assignment, client sampling, model initialization, and random sparsification.
config.json records all experiment settings.
metrics.csv is suitable for downstream statistical analysis.
run.log contains detailed English logs for auditing each experiment.
The ablation script uses consistent shared arguments so methods are compared under the same global configuration.

Citation-Oriented Summary

FedPrune-Sparse evaluates a hybrid resource-aware federated learning pipeline in which structured or unstructured pruning reduces local client computation, while sparse delta communication with error feedback reduces uplink bandwidth. The design preserves FedAvg compatibility by expanding structured-pruned local updates back to the global tensor shape before aggregation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FedPrune-Sparse

Research Objective

Federated Round Flow

Project Structure

Implemented Methods

Model Pruning

Delta Sparsification

Installation

Quick Start

Results and Logs

Key Metrics

Important Arguments

Reproducibility Notes

Citation-Oriented Summary

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
client		client
models		models
scripts		scripts
server		server
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.fa.md		README.fa.md
README.md		README.md
requirements.txt		requirements.txt
run_simulation.py		run_simulation.py

Folders and files

Latest commit

History

Repository files navigation

FedPrune-Sparse

Research Objective

Federated Round Flow

Project Structure

Implemented Methods

Model Pruning

Delta Sparsification

Installation

Quick Start

Results and Logs

Key Metrics

Important Arguments

Reproducibility Notes

Citation-Oriented Summary

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages