12-09-2017 08:58 PM
#1
| |
| |
12-10-2017 04:49 AM
#2
| |
It might have expected AZ to do something else the first time, or it may have just realised its first response was not optimal. Don't think it was trying to probe AZ for weaknesses. That's theory of mind shit and not something computers are capable of afaik. | |
12-10-2017 07:47 AM
#3
| |
![]() ![]()
|
The reason it's different is because it's constantly learning, it evaluated a move then it happened and could see further so when the move was repeated it then saw a better option. There is also the fact that it might realise that if it plays move A and the other player doesn't play the exact perfect move it's more winning and if it does it's repeated therefore playing move A first before trying the next best option of B is a better play. |
12-10-2017 12:23 PM
#4
| |
| |