Change Log

v2.0.0

MLBench Helm

Implemented enhancements:

  • Added GKE and AWS Setup Scripts

MLBench Benchmarks

Implemented enhancements:

  • Added Goals to PyTorch Benchmark

  • Updated PyTorch Tutorial code

  • Changed all images to newest mlbench-core version.

MLBench Dashboard

Implemented enhancements:

  • Added Download of Task Goals

  • Fixed some performance issues

v2.4.0

MLBench Core

v2.4.0 (2020-04-20)

Full Changelog

Implemented enhancements:

  • Switch to black for code formatting #35

Closed issues:

  • Travis tests run only for Python 3.6 #65

  • Downloading results fails if --output option is not provided #57

  • Remember user input in mlbench run #56

  • Aggregate the gradients by model, instead of by layers. #45

  • Update docker images to CUDA10, mlbench-core module to newest #43

  • Upgrade PyTorch to 1.4 #40

Merged pull requests:

v2.3.2

MLBench Core

v2.3.2 (2020-04-07)

Full Changelog

Implemented enhancements:

  • Add NCCL & GLOO Backend support #49

  • Add NCCL & GLOO Backend support #47 (giorgiosav)

Fixed bugs:

  • math ValueError with 1-node cluster #38

Merged pull requests:

v2.3.1

MLBench Core

2.3.1 (2020-03-09)

Full Changelog

Implemented enhancements:

  • Customize Communication Scheme For Sparsified/Quantizatized/Decentralized scenarios #12

v2.3.0

MLBench Core

v2.3.0 (2019-12-23)

Full Changelog

v2.2.1

MLBench Core

v2.2.1 (2019-12-16)

Full Changelog

v2.2.0

MLBench Core

v2.2.0 (2019-11-11)

Full Changelog

Implemented enhancements: - initialize_backends can now be called as context manager - Improved CLI to run multiple runs in parallel

v2.1.1

MLBench Core

v2.1.1 (2019-11-11)

Full Changelog

v2.1.0

MLBench Core

v2.1.0 (2019-11-4)

Full Changelog

Implemented enhancements:

  • Added CLI for MLBench runs

v2.0.0

MLBench Core

v2.0.0 (2019-06-13)

Full Changelog

v1.4.4

MLBench Core

v1.4.4 (2019-05-28)

Full Changelog

v1.4.3

MLBench Core

v1.4.3 (2019-05-23)

Full Changelog

v1.4.2

MLBench Core

v1.4.2 (2019-05-21)

Full Changelog

v1.4.1

MLBench Core

v1.4.1 (2019-05-16)

Full Changelog

v1.4.0

MLBench Core

v1.4.0 (2019-05-02)

Full Changelog

Implemented enhancements:

  • Split Train and Validation in Tensorflow #22

v1.3.4

MLBench Core

v1.3.4 (2019-03-20)

Full Changelog

Implemented enhancements:

  • in controlflow, don’t mix train and validation #20

Fixed bugs:

  • Add metrics logging for Tensorflow #19

v1.3.3

MLBench Core

v1.3.3 (2019-02-26)

Full Changelog

v1.3.2

MLBench Core

v1.3.2 (2019-02-13)

Full Changelog

v1.3.1

MLBench Core

v1.3.1 (2019-02-13)

Full Changelog

v1.3.0

MLBench Core

v1.3.0 (2019-02-12)

Full Changelog

v1.2.1

MLBench Core

v1.2.1 (2019-01-31)

Full Changelog

v1.2.0

MLBench Core

v1.2.0 (2019-01-30)

Full Changelog

v1.1.1

MLBench Core

v1.1.1 (2019-01-09)

Full Changelog

v1.1.0

MLBench Core

v1.1.0 (2018-12-06)

Full Changelog

Fixed bugs:

  • Bug when saving checkpoints #13

Implemented enhancements:

  • Adds Tensorflow Controlflow, Dataset and Model code

  • Adds Pytorch linear models

  • Adds sparsified and decentralized optimizers

MLBench Benchmarks

Implemented enhancements:

  • Added Tensorflow Benchmark

MLBench Dashboard

Implemented enhancements:

  • Added new Tensorflow Benchmark Image

  • Remove Bandwidth limiting

  • Added ability to run custom images in dashboard

MLBench Helm

Nothing

v1.0.0

MLBench Core

1.0.0 (2018-11-15)

Implemented enhancements:

  • Add API Client to mlbench-core #6

  • Move to google-style docs #4

  • Add Imagenet Dataset for pytorch #3

  • Move worker code to mlbench-core repo #1

v0.1.0

Main Repo

0.1.0 (2018-09-14)

Implemented enhancements:

  • Add documentation in reference implementation to docs #46

  • Replace cAdvisor with Kubernetes stats for Resource usage #38

  • Rename folders #31

  • Change docker image names #30

  • Add continuous output for mpirun #27

  • Replace SQlite with Postgres #25

  • Fix unittest #23

  • Add/Fix CI/Automated build #22

  • Cleanup unneeded project files #21

  • Remove hardcoded values #20

  • Improves Notes.txt #19

  • Rename components #15

Fixed bugs:

  • 504 Error when downloading metrics for long runs #61

Closed issues:

  • small doc improvements for first release #54

  • Check mlbench works on Google Cloud #51

  • learning rate scheduler #50

  • Add Nvidia k8s-device-plugin to charts #48

  • Add Weave to Helm Chart #41

  • Allow limiting of resources for experiments #39

  • Allow downloading of Run measurements #35

  • Worker Details page #33

  • Run Visualizations #32

  • Show experiment history in Dashboard #18

  • Show model progress in Dashboard #13

  • Report cluster status in Dashboard #12

  • Send metrics from SGD example to metrics api #11

  • Add metrics endpoint for experiments #10

  • Let Coordinator Dashboard start a distributed Experiment #9

  • Add mini-batch SGD model experiment #8

* This Change Log was automatically generated by `github_changelog_generator <https://github.com/skywinder/Github-Changelog-Generator>`__