# Introduction

Author of the paper: New York University and Facebook AI Research

• Proposed CommNet,which enable cooperating agents to learn communicating before take an action through a continuous channel.
• Agents prefer not to communicate it to the others unless neccessary accroding to the experiments.
• Tested on Lever Pulling task and Traffic Junction environments.
• paper
• code

# Approach

Setting: partial observe,global and continuous communication channel,the state input is $s=[s_1,s_2,...,s_J]$,what we need to get is $a=\phi(s)$

use the policy update with baseline to calc the $\delta \theta$

## Extensions

### Local Connectivity

we can extend the global communication to local communication ,replace the $c_j$ calc

### Skip Connections

Cause the $h_j^0$ contains info about the self agent obs,for some case we can contain it into every communicate step

### Temporal Recurrence

We can use RNN or LSTM to replace MLP we used before to enable remember info important.

# Experiments

I won't go every experiments listed in paper ,but there are some important results in it.

• use CommNet with LSTM gain better perfomance than others
• Continuous channel can do better than discrete channel

# Notes

• single network with a communication channel at each layer is not easy to scale up