Selection strategies (3) with noise

I re-ran all experiments again, but with a certain noise level to the action.
The noise has been added to all dimensions of the action with a new random number.
The random number comes from a standard deviation with sigma=1 and mean=0 and has an interval of [-1;1].
A noise level of 5% means that the highest reachable noise value is equal to -5% or +5% of the possible action range.

Example:
In the SinusEnvironment the action range is equal to [0;3]. 5% of this range is 0.15 which is multiplied with a random number and than added to the action value. The action=1.0 could therefore be change to an set of values that lies in the interval [0.85; 1.15].
( actionValue = rangeLength * noiseLevel * randomNumber + realActionValue )

noise level: 5%, 10% and 25%

Sinus 5%:

Name reward splits steps samples
greedyTuctF 0.6035023062423907 18.802 2.0 1000.0
greedyFuctF 0.6040585838460535 18.787 2.0 1000.0
greedyFuctT 0.5979691038489539 15.911 2.0 1000.0
greedyTuctT 0.59825262100435 15.88 2.0 1000.0

Sinus 10%:

Name reward splits steps samples
greedyTuctF 0.6066269988505658 8.818 2.0 1000.0
greedyFuctF 0.6067923388366762 8.822 2.0 1000.0
greedyFuctT 0.6062219667448856 8.838 2.0 1000.0
greedyTuctT 0.6062244257106849 8.836 2.0 1000.0

Sinus 25%:

Name reward splits steps samples
greedyTuctF 0.5939506761968181 6.226 2.0 1000.0
greedyFuctF 0.5940263413928148 6.23 2.0 1000.0
greedyFuctT 0.5930231757837592 6.28 2.0 1000.0
greedyTuctT 0.5928948564852644 6.281 2.0 1000.0

Sixhumpcamelback 5%:

Name reward splits steps samples
greedyTuctF 0.7091581425244804 27.924 2.0 1000.0
greedyFuctF 0.7124759285110971 27.943 2.0 1000.0
greedyFuctT 0.7095909070580521 27.728 2.0 1000.0
greedyTuctT 0.7138356501297819 27.598 2.0 1000.0

Sixhumpcamelback 10%:

Name reward splits steps samples
greedyTuctF 0.006789875018259652 24.004 2.0 1000.0
greedyFuctF 0.038062705281156656 24.023 2.0 1000.0
greedyFuctT 0.04887399195052821 23.991 2.0 1000.0
greedyTuctT 0.05299734373558295 23.98 2.0 1000.0

Sixhumpcamelback 25%:

Name reward splits steps samples
greedyTuctF -2.6071878023028106 23.307 2.0 1000.0
greedyFuctF -2.567087832987661 23.266 2.0 1000.0
greedyFuctT -2.464286323408397 23.357 2.0 1000.0
greedyTuctT -2.5385730999717024 23.335 2.0 1000.0

DonutWorld 5%:

Name reward splits steps samples
greedyTuctF 2.581416127418022 127.288 3.0 1000.0
greedyFuctF 2.5779135800014794 127.256 3.0 1000.0
greedyFuctT 2.601451485622715 121.695 3.0 1000.0
greedyTuctT 2.6013879848545107 121.661 3.0 1000.0

DonutWorld 10%:

Name reward splits steps samples
greedyTuctF 2.2842186987644393 98.103 3.0 1000.0
greedyFuctF 2.2804123932855713 98.188 3.0 1000.0
greedyFuctT 2.284743097292101 92.289 3.0 1000.0
greedyTuctT 2.289177410042218 92.123 3.0 1000.0

DonutWorld 25%:

Name reward splits steps samples
greedyTuctF 1.4885869776709448 65.289 3.0 1000.0
greedyFuctF 1.4879622238233174 65.183 3.0 1000.0
greedyFuctT 1.5125176928198325 57.452 3.0 1000.0
greedyTuctT 1.511118881469104 57.406 3.0 1000.0
Advertisements
This entry was posted in Thesis Progress. Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s