Difference between revisions of "PIC32MX: Benchmarking Mathematical Operations"
(New page: == Original Assignment == '''Do not erase this section!''' Your assignment is to == Overview == Summarize briefly what the page is about. == Circuit == Include a schematic and give a...) |
NickMarchuk (talk | contribs) |
||
(295 intermediate revisions by 4 users not shown) | |||
Line 1: | Line 1: | ||
== |
== Overview == |
||
We were tasked with determining the real-time cost (measured in nanoseconds) of performing seven basic mathematical operations with each one of the six commonly used ANSI C data types. |
|||
'''Do not erase this section!''' |
|||
The mathematical operations we tested were: |
|||
Your assignment is to |
|||
* subtraction |
|||
* addition |
|||
* multiplication |
|||
* division |
|||
* square root |
|||
* sine |
|||
* cosine |
|||
The six data types we tested each operation on were: |
|||
== Overview == |
|||
* char |
|||
* short |
|||
* integer |
|||
* long long |
|||
* float |
|||
* double |
|||
Our testing procedure was simple: throw an output pin high on the NU32 development board, perform a mathematical operation with a given data type, and then pull the same pin low. |
|||
Summarize briefly what the page is about. |
|||
Placing the above three steps in an infinite while loop afforded us the opportunity to use an oscilloscope to measure the duration between each high-low pair in the output waveform. After subtracting the time it took for the PIC to raise and lower the voltage on the output pin (something we previously measured), we were able to determine the amount of time required for the PIC32 chip to execute an operation with a high level of accuracy. |
|||
== Circuit == |
|||
With seven operations to perform on six different data types, we created the following table to help us assign and keep track of the various tests we planned to run: |
|||
Include a schematic and give any part numbers. A photo of your circuit is OK, but not as a replacement for a schematic. |
|||
{| class="wikitable" style="text-align:right;" border="0" CELLSPACING = "10" FRAME = "LEFT" |
|||
|+Operation vs. Data type |
|||
|- |
|||
! !! char (8-bit) !! short (16-bit) !! int (32-bit) !! long long (64-bit) !! float (32-bit) !! double (64-bit) |
|||
|- |
|||
! subtraction |
|||
| Test 2 || Test 9 || Test 16 || Test 23 || Test 30 || Test 37 |
|||
|- |
|||
! addition |
|||
| Test 3 || Test 10 || Test 17 || Test 24 || Test 31 || Test 38 |
|||
|- |
|||
! multiplication |
|||
| Test 4 || Test 11 || Test 18 || Test 25 || Test 32 || Test 39 |
|||
|- |
|||
! division |
|||
| Test 5 || Test 12 || Test 19 || Test 26 || Test 33 || Test 40 |
|||
|- |
|||
! square root |
|||
| Test 6 || Test 13 || Test 20 || Test 27 || Test 34 || Test 41 |
|||
|- |
|||
! sine |
|||
| Test 7 || Test 14 || Test 21 || Test 28 || Test 35 || Test 42 |
|||
|- |
|||
! cosine |
|||
| Test 8 || Test 15 || Test 22 || Test 29 || Test 36 || Test 43 |
|||
|} |
|||
Several tests contained multiple procedures that explored various ways to carry out a given mathematical operation on a given data type. For example, in the multiplication tests, not only did we test the traditional multiplication operator (*), but also the bitwise left shift operator (<<). Our goal was to find out if one particular operator was faster than the other. Similarly, we also included procedures that performed the above operations on hard-coded numbers (such as 347) as well as randomly chosen numbers stored in variables (such as 'random_int1'). We wanted to ensure that the compiler didn't compute the results of each operation in advance. While pre-compiling can indeed afford welcome increases in execution time, situations in which the compiler can't optimize the operations ahead of time (for example, situations where the data to be operated on is not known in advance) are still common occurrences and are worth benchmarking. |
|||
Accordingly, several tests contain multiple procedures that not only account for multiple methods of performing a particular operation, but multiple sets of numbers to perform those operations on. |
|||
Test 1 was used to determine the duration required for the PIC32 to throw a pin high and pull a pin low, while Tests 2 through 43 were used to measure the actual performance of each operation and data-type pair. |
|||
==Circuit== |
|||
The circuit used for testing consisted of connecting the oscilloscope to the output pin (in this case, pin A2) in order to view the waveform. |
|||
[[Image:Kp_mw_tp_benchmarkingcircuit2010.jpg]] |
|||
== Results == |
|||
Below is the quick summary of the testing results comparing each data type and each operation. All results are normalized to 60 ns (1.00 = 60 ns). |
|||
{| class="wikitable" style="text-align:right;" border="0" CELLSPACING = "10" FRAME = "LEFT" |
|||
|+Operation vs. Data type |
|||
|- |
|||
! !! Subtraction !! Addition !! Multiplication !! Division !! Square Root !! Sine !! Cosine !! |
|||
|- |
|||
! Char |
|||
| 1.87 || 1.65 || 2.28 || N/A || 34.78 || 116.03 || 98.93 |
|||
|- |
|||
! Short |
|||
| 1.03 || 1.67 || 1.47 || 5.00 || 144.57 || 230.40 || 215.40 |
|||
|- |
|||
! Int |
|||
| 1.07 || 1.00 || 1.43 || 8.10 || 145.62 || 316.43 || 330.62 |
|||
|- |
|||
! Long Long |
|||
| 2.50 || 3.33 || 6.63 || 28.73 || 271.85 || 498.30 || 510.38 |
|||
|- |
|||
! Float |
|||
| 15.00 || 17.07 || 12.27 || 27.90 || 143.93 || 326.03 || 338.28 |
|||
|- |
|||
! Double |
|||
| 26.00 || 20.60 || 23.97 || 53.07 || 133.30 || 343.73 || 333.52 |
|||
|} |
|||
Here is the Excel file with the full results, in nanoseconds and normalized. [[Media:Lab5.xls]] |
|||
Below are the results of each particular test we performed, coupled with a short explanation for each result. |
|||
=== Basic Timing Constants (Test 1) === |
|||
{{multiple image |
|||
| width = 100 |
|||
| footer = Actual waveforms as seen on the output pin. |
|||
| image1 = Test01-high-duration.png |
|||
| alt1 = Test (a): Time required to throw an output pin high |
|||
| caption1 = Test (a): Time required to throw an output pin high |
|||
| image2 = Test01-low-duration.png |
|||
| alt2 = Test (b): Time required to pull an output pin low |
|||
| caption2 = Test (b): Time required to pull an output pin low |
|||
| image3 = Test01-full-while-loop.png |
|||
| alt3 = Test (c): Time required to execute 1 empty while loop cycle |
|||
| caption3 = Test (c): Time required to execute 1 empty while loop cycle |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to push a given output pin high and pull the same pin low. |
|||
* Test (a): Time required to throw an output pin high |
|||
** Instruction: <code>PIN_A2 = 1;</code> |
|||
** Time: 63 ns |
|||
* Test (b): Time required to pull an output pin low |
|||
** Instruction: <code>PIN_A2 = 0;</code> |
|||
** Time: 63 ns |
|||
* Test (c): Time required to execute 1 empty while loop cycle |
|||
** Instruction: <code>while(1){}</code> |
|||
** Time: 23 ns |
|||
=== char Performance === |
|||
---- |
|||
A <code>char</code> data type, in ANSI C, is a value holding one byte, or one character code. The actual number of bits in a char in a particular implementation is documented as CHAR_BIT in that implementation's <code>limits.h</code> file. In practice, it is almost always 8 bits, corresponding to a decimal range of 0 to 255 inclusive. Given that there are many different ways to perform a given operation on a char, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) ASCII letters, all (b) benchmarks are operations on two predefined (and most likely pre-computed) numbers in the range of 0 to 255, and all (c) benchmarks are operations on two random (and most likely '''not''' pre-computed) numbers. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time. |
|||
==== Subtraction (Test 2) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test02-parta.png |
|||
| alt1 = Test (a): Time required to subtract two constant chars |
|||
| caption1 = Test (a): Time required to subtract two constant chars |
|||
| image2 = Test02-partb.png |
|||
| alt2 = Test (b): Time required to subtract two ints cast into a char |
|||
| caption2 = Test (b): Time required to subtract two ints cast into a char |
|||
| image3 = Test02-partc.png |
|||
| alt3 = Test (c): Time required to subtract two random chars |
|||
| caption3 = Test (c): Time required to subtract two random chars |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to subtract one 8-bit number (a char) from another 8-bit number (a char). |
|||
* Test (a): Time required to subtract two constant chars (may be pre-computed) |
|||
** Instruction: <code>letter_capital_a = 'z'-'7';</code> |
|||
** Time: 50 ns |
|||
* Test (b): Time required to subtract two ints cast into a char (may be pre-computed) |
|||
** Instruction: <code>letter_b = 100-2;</code> |
|||
** Time: 50 ns |
|||
* Test (c): Time required to subtract two random chars (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_char = larger-smaller;</code>* |
|||
** Time: 112 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Addition (Test 3) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test03-parta.png |
|||
| alt1 = Test (a): Time required to add two constant chars |
|||
| caption1 = Test (a): Time required to add two constant chars |
|||
| image2 = Test03-partb.png |
|||
| alt2 = Test (b): Time required to add two ints cast into a char |
|||
| caption2 = Test (b): Time required to add two ints cast into a char |
|||
| image3 = Test03-partc.png |
|||
| alt3 = Test (c): Time required to add two random chars |
|||
| caption3 = Test (c): Time required to add two random chars |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to add one 8-bit number (a char) to another 8-bit number (a char). |
|||
* Test (a): Time required to add two constant chars (may be pre-computed) |
|||
** Instruction: <code>letter_a = ')'+'8';</code> |
|||
** Time: 50 ns |
|||
* Test (b): Time required to add two ints cast into a char (may be pre-computed) |
|||
** Instruction: <code>letter_b = 97+1;</code> |
|||
** Time: 50 ns |
|||
* Test (c): Time required to add two random chars (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_char = random_char1+random_char2;</code>* |
|||
** Time: 99 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Multiplication (Test 4) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test04-parta.jpg |
|||
| alt1 = Test (a): Time required to multiply two constant chars |
|||
| caption1 = Test (a): Time required to multiply two constant chars |
|||
| image2 = Test04-partb.jpg |
|||
| alt2 = Test (b): Time required to multiply two ints cast into a char |
|||
| caption2 = Test (b): Time required to multiply two ints cast into a char |
|||
| image3 = Test04-partc.jpg |
|||
| alt3 = Test (c): Time required to multiply of two random chars |
|||
| caption3 = Test (c): Time required to multiply of two random chars |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to multiply one 8-bit number (a char) by another 8-bit number (a char). |
|||
* Test (a): Time required to multiply two constant chars (may be pre-computed) |
|||
** Instruction: <code>ascii_225 = 'K'*'♥';</code> |
|||
** Time: 49 ns |
|||
* Test (b): Time required to multiply two ints cast into a char (may be pre-computed) |
|||
** Instruction: <code>ascii_200 = 100*2;</code> |
|||
** Time: 48 ns |
|||
* Test (c): Time required to multiply of two random chars (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_char = larger*smaller;</code>* |
|||
** Time: 137 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Division (Test 5) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test05-parta.jpg |
|||
| alt1 = Test (a): Time required to divide two constant chars |
|||
| caption1 = Test (a): Time required to divide two constant chars |
|||
| image2 = Test05-parta.jpg |
|||
| alt2 = Test (b): Time required to divide two ints cast into a char |
|||
| caption2 = Test (b): Time required to divide two ints cast into a char |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to divide one 8-bit number (a char) by another 8-bit number (a char). |
|||
* Test (a): Time required to divide two constant chars (may be pre-computed) |
|||
** Instruction: <code>ascii_25 = 'K'/'♥'; //thp</code> |
|||
** Time: 48 ns |
|||
* Test (b): Time required to divide two ints cast into a char (may be pre-computed) |
|||
** Instruction: <code>letter_2 = 100/2;</code> |
|||
** Time: 50 ns |
|||
* Test (c): Time required to divide two random chars (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_char = larger/smaller;</code>* |
|||
** Time: N/A |
|||
<nowiki>*</nowiki> We had great difficulty in trying to test this particular operation. After some research with an oscilloscope and voltmeter, it seems that the PIC32 is not capable of dividing chars in this way. More specifically, every time the PIC32 attempts to divide one char by another, all output pins are immediately grounded. We've tested this code in other C environments, and it works as expected, so the error must either lie somewhere within our specific PICs (which would be very unlikely - we tested 3) or the silicon architecture of the PIC32 itself (sill unlikely, but given the number of PICs we tested, more probable). If you absolutely need to divide chars, cast them to ints first, perform your division, then cast them back to chars. |
|||
==== Square Root (Test 6) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test06-parta.jpg |
|||
| alt1 = Test (a): Time required to sqrt() a constant char |
|||
| caption1 = Test (a): Time required to sqrt() a constant char |
|||
| image2 = Test06-partb.jpg |
|||
| alt2 = Test (b): Time required to sqrt() an int cast into a char |
|||
| caption2 = Test (b): Time required to sqrt() an int cast into a char |
|||
| image3 = Test06-partc.jpg |
|||
| alt3 = Test (c): Time required to sqrt() a random char |
|||
| caption3 = Test (c): Time required to sqrt() a random char |
|||
}} |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test06-partd.jpg |
|||
| alt1 = Test (d): Time required to ^(1/2) a constant char |
|||
| caption1 = Test (d): Time required to ^(1/2) a constant char |
|||
| image2 = Test06-parte.jpg |
|||
| alt2 = Test (e): Time required to ^(1/2) an int cast into a char |
|||
| caption2 = Test (e): Time required to ^(1/2) an int cast into a char |
|||
| image3 = Test06-partf.jpg |
|||
| alt3 = Test (f): Time required to ^(1/2) a random char |
|||
| caption3 = Test (f): Time required to ^(1/2) a random char |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to square root one 8-bit number (a char). Benchmarks (a) through (c) use the <code>sqrt()</code> function while benchmarks (d) through (f) raise the operands to the 1/2 power. |
|||
* Test (a): Time required to sqrt() a constant char (may be pre-computed) |
|||
** Instruction: <code>ascii_25 = sqrt('u');</code> |
|||
** Time: 48 ns |
|||
* Test (b): Time required to sqrt() an int cast into a char (may be pre-computed) |
|||
** Instruction: <code>number_10 = sqrt(100);</code> |
|||
** Time: 48 ns |
|||
* Test (c): Time required to sqrt() a random char (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_char = sqrt(random_char1);</code> |
|||
** Time: 2087 ns |
|||
* Test (d): Time required to ^(1/2) a constant char (may be pre-computed) |
|||
** Instruction: <code>ascii_25 = ('u')^(1/2);</code> |
|||
** Time: 48 ns |
|||
* Test (e): Time required to ^(1/2) an int cast into a char (may be pre-computed) |
|||
** Instruction: <code>number_10 = (100)^(1/2);</code> |
|||
** Time: 48 ns |
|||
* Test (f): Time required to ^(1/2) a random char (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_char = (random_char2)^(1/2);</code>* |
|||
** Time: 75 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Sine (Test 7) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test07-parta.jpg |
|||
| alt1 = Test (a): Time required to take the sine of a constant char |
|||
| caption1 = Test (a): Time required to take the sine of a constant char |
|||
| image2 = Test07-partb.jpg |
|||
| alt2 = Test (b): Time required to take the sine of a constant int cast into a char |
|||
| caption2 = Test (b): Time required to take the sine of a constant int cast into a char |
|||
| image3 = Test07-partc.jpg |
|||
| alt3 = Test (c): Time required to take the sine of a random char |
|||
| caption3 = Test (c): Time required to take the sine of a random char |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to take the sine of an 8-bit number (a char). |
|||
* Test (a): Time required to take the sine of a constant char (may be pre-computed) |
|||
** Instruction: <code>ascii_25 = sin('K');</code> |
|||
** Time: 9963 ns |
|||
* Test (b): Time required to take the sine of a constant int cast into a char (may be pre-computed) |
|||
** Instruction: <code>letter_2 = sin(50);</code> |
|||
** Time: 9550 ns |
|||
* Test (c): Time required to take the sine of a random char (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_char = sin(larger);</code>* |
|||
** Time: 6962 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Cosine (Test 8) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test08-parta.jpg |
|||
| alt1 = Test (a): Time required to take the cosine of a constant char |
|||
| caption1 = Test (a): Time required to take the cosine of a constant char |
|||
| image2 = Test08-partb.jpg |
|||
| alt2 = Test (b): Time required to take the cosine of a constant int cast into a char |
|||
| caption2 = Test (b): Time required to take the cosine of a constant int cast into a char |
|||
| image3 = Test08-partc.jpg |
|||
| alt3 = Test (c): Time required to take the cosine of a random char |
|||
| caption3 = Test (c): Time required to take the cosine of a random char |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to take the cosine of an 8-bit number (a char). |
|||
* Test (a): Time required to take the cosine of a constant char (may be pre-computed) |
|||
** Instruction: <code>ascii_25 = cos('K');</code> |
|||
** Time: 9111 ns |
|||
* Test (b): Time required to take the cosine of a constant int cast into a char (may be pre-computed) |
|||
** Instruction: <code>letter_2 = cos(50);</code> |
|||
** Time: 8724 ns |
|||
* Test (c): Time required to take the cosine of a random char (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_char = cos(larger);</code>* |
|||
** Time: 5936 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
=== short Performance === |
|||
---- |
|||
A short data type, in ANSI C, is a value that holds 2 bytes, or 16 bits. This corresponds to a range of 0 to 65535 (2^16 - 1). If the variable is signed (negative), then the range is from -32767 to 32767 (-2^15 + 1 to 2^15 -1). Given that there are many different ways to perform a given operation on a short, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) shorts and all (b) benchmarks are operations on two random (and most likely '''not''' pre-computed) shorts. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time. |
|||
==== Subtraction (Test 9) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = test09-parta.png |
|||
| alt1 = Test (a): Time required to subtract two constant shorts |
|||
| caption1 = Test (a): Time required to subtract two constant shorts |
|||
| image2 = test09-partb.png |
|||
| alt2 = Test (b): Time required to subtract two random shorts |
|||
| caption2 = Test (b): Time required to subtract two random shorts |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to subtract one 16-bit number (a short) from another 16-bit number (a short). |
|||
*Test (a): Time required to subtract two constant shorts (may be pre-computed) |
|||
** Instruction: <code>short1 = 1337-343;</code> |
|||
** Time: 25 ns |
|||
*Test (b): Time required to subtract two random shorts (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_short = larger-smaller;</code> |
|||
** Time: 62 ns |
|||
==== Addition (Test 10) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test10-parta.png |
|||
| alt1 = Test (a): Time required to add two constant shorts |
|||
| caption1 = Test (a): Time required to add two constant shorts |
|||
| image2 = Test10-partb.png |
|||
| alt2 = Test (b): Time required to add two random shorts |
|||
| caption2 = Test (b): Time required to add two random shorts |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to add one 16-bit number (a short) to another 16-bit number (a short). |
|||
*Test (a): Time required to add two constant shorts (may be pre-computed) |
|||
**Instruction: <code>short1 = 1337+343;</code> |
|||
**Time: 50 ns |
|||
*Test (b): Time required to add two random shorts (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_short = larger-smaller;</code> |
|||
**Time: 100 ns |
|||
==== Multiplication (Test 11) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test11-parta.png |
|||
| alt1 = Test (a): Time required to multiply two constant shorts |
|||
| caption1 = Test (a): Time required to multiply two constant shorts |
|||
| image2 = Test11-partb.png |
|||
| alt2 = Test (b): Time required to multiply two random shorts |
|||
| caption2 = Test (b): Time required to multiply two random shorts |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to multiply one 16-bit number (a short) by another 16-bit number (a short). |
|||
*Test (a): Time required to multiply two constant shorts (may be pre-computed) |
|||
**Instruction: <code>short1 = 47*347;</code> |
|||
**Time: 24 ns |
|||
*Test (b): Time required to multiply two random shorts (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_short = larger*smaller;</code> |
|||
**Time: 88 ns |
|||
==== Division (Test 12) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test12-parta.png |
|||
| alt1 = Test (a): Time required to divide two constant shorts |
|||
| caption1 = Test (a): Time required to divide two constant shorts |
|||
| image2 = Test12-partb.png |
|||
| alt2 = Test (b): Time required to divide two random shorts |
|||
| caption2 = Test (b): Time required to divide two random shorts |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to divide one 16-bit number (a short) by another 16-bit number (a short). |
|||
*Test (a): Time required to divide two constant shorts (may be pre-computed) |
|||
**Instruction: <code>short1 = 62488/347;</code> |
|||
**Time: 28 ns |
|||
*Test (b): Time required to divide two random shorts (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_short = larger/smaller;</code> |
|||
**Time: 300 ns |
|||
==== Square Root (Test 13) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test13-parta.png |
|||
| alt1 = Test (a): Time required to sqrt() a constant short |
|||
| caption1 = Test (a): Time required to sqrt() a constant short |
|||
| image2 = Test13-partb.png |
|||
| alt2 = Test (b): Time required to sqrt() a random short |
|||
| caption2 = Test (b): Time required to sqrt() a random short |
|||
| image3 = Test13-partc.png |
|||
| alt3 = Test (c): Time required to ^(1/2) a constant short |
|||
| caption3 = Test (c): Time required to ^(1/2) a constant short |
|||
| image4 = Test13-partd.png |
|||
| alt4 = Test (d): Time required to ^(1/2) a random short |
|||
| caption4 = Test (d): Time required to ^(1/2) a random short |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the square root of one 16-bit number (a short). Tests (a) and (b) use the 'sqrt()' method, while tests (c) and (d) use a number to the 1/2 power. |
|||
*Test (a): Time required to sqrt() a constant short (may be pre-computed) |
|||
**Instruction: <code>short1 = sqrt(31337);</code> |
|||
**Time: 50 ns |
|||
*Test (b): Time required to sqrt() a random short (guaranteed not to be pre-comp)* |
|||
**Instruction: <code>short2 = sqrt(random_short1);</code> |
|||
**Time: 8674 ns |
|||
*Test (c): Time required to ^(1/2) a constant short (may be pre-computed) |
|||
**Instruction: <code>short3 = (30343)^(1/2);</code> |
|||
**Time: 50 ns |
|||
*Test (d): Time required to ^(1/2) a random short (guaranteed not to be pre-comp)* |
|||
**Instruction: <code>short4 = (random_short2)^(1/2);</code> |
|||
**Time: 76 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Sine (Test 14) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test14-parta.png |
|||
| alt1 = Test (a): Time required to take the sine of a constant short |
|||
| caption1 = Test (a): Time required to take the sine of a constant short |
|||
| image2 = Test14-partb.png |
|||
| alt2 = Test (b): Time required to take the sine of a random short |
|||
| caption2 = Test (b): Time required to take the sine of a random short |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the sine of one 16-bit number (a short). |
|||
*Test (a): Time required to take the sine of a constant short (may be pre-computed) |
|||
**Instruction: <code>short1 = sin(31337);</code> |
|||
**Time: 13014 ns |
|||
*Test (b): Time required to take the sine of a random short (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_short = sin(random_short1);</code> |
|||
**Time: 13824 ns |
|||
==== Cosine (Test 15) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test15-parta.png |
|||
| alt1 = Test (a): Time required to take the cosine of a constant short |
|||
| caption1 = Test (a): Time required to take the cosine of a constant short |
|||
| image2 = Test15-partb.png |
|||
| alt2 = Test (b): Time required to take the cosine of a random short |
|||
| caption2 = Test (b): Time required to take the cosine of a random short |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the cosine of one 16-bit number (a short). |
|||
*Test (a): Time required to take the cosine of a constant short (may be pre-computed) |
|||
**Instruction: <code>short1 = cos(31337);</code> |
|||
**Time: 12174 ns |
|||
*Test (b): Time required to take the cosine of a random short (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_short = cos(random_short1);</code> |
|||
**Time: 12924 ns |
|||
===int Performance=== |
|||
---- |
|||
An int data type, in ANSI C, is a value that holds 4 bytes, or 32 bits. This corresponds to a range of 0 to 4294967295 (2^32 - 1). If the variable is signed (negative), then the range is from -2147483647 to 2147483647 (-2^31 + 1 to 2^31 -1). Given that there are many different ways to perform a given operation on an int, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) ints and all (b) benchmarks are operations on two random (and most likely '''not''' pre-computed) ints. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time. |
|||
==== Subtraction (Test 16) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test16-parta.png |
|||
| alt1 = Time required to subtract two constant ints |
|||
| caption1 = Time required to subtract two constant ints |
|||
| image2 = Test16-partb.png |
|||
| alt2 = Test (b): Time required to subtract two random ints |
|||
| caption2 = Test (b): Time required to subtract two random ints |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to subtract one 32-bit number (an int) from another 32-bit number (an int). |
|||
*Test (a): Time required to subtract two constant ints (may be pre-computed) |
|||
**Instruction: <code>int1 = 2271988-7889;</code> |
|||
**Time: 38 ns |
|||
*Test (b): Time required to subtract two random ints (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_int = larger-smaller;</code> |
|||
**Time: 64 ns |
|||
==== Addition (Test 17) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| footer = Actual waveforms as seen on the output pin. |
|||
| image1 = Test17-parta.png |
|||
| alt1 = Test (a): Time required to add two constant ints |
|||
| caption1 = Test (a): Time required to add two constant ints |
|||
| image2 = Test17-partb.png |
|||
| alt2 = Test (b): Time required to add two random ints |
|||
| caption2 = Test (b): Time required to add two random ints |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to add one 32-bit number (an int) to another 32-bit number (an int). |
|||
*Test (a): Time required to add two constant ints (may be pre-computed) |
|||
**Instruction: |
|||
**Time: 26 ns |
|||
*Test (b): Time required to add two random ints (guaranteed not to be pre-comp) |
|||
**Instructions: |
|||
**Time: 60 ns |
|||
==== Multiplication (Test 18) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test18-parta.png |
|||
| alt1 = Test (a): Time required to multiply two constant ints |
|||
| caption1 = Test (a): Time required to multiply two constant ints |
|||
| image2 = Test18-partb.png |
|||
| alt2 = Test (b): Time required to multiply two random ints |
|||
| caption2 = Test (b): Time required to multiply two random ints |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to multiply one 32-bit number (an int) by another 32-bit number (an int). |
|||
*Test (a): Time required to multiply two constant ints (may be pre-computed) |
|||
**Instruction: <code>int1 = 65500*6550;</code> |
|||
**Time: 38 ns |
|||
*Test (b): Time required to multiply two random ints (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_int = random_int1*random_int2;</code> |
|||
**Time: 86 ns |
|||
==== Division (Test 19) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test19-parta.png |
|||
| alt1 = Test (a): Time required to divide two constant ints |
|||
| caption1 = Test (a): Time required to divide two constant ints |
|||
| image2 = Test19-partb.png |
|||
| alt2 = Test (b): Time required to divide two random ints |
|||
| caption2 = Test (b): Time required to divide two random ints |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to divide one 32-bit number (an int) by another 32-bit number (an int). |
|||
*Test (a): Time required to divide two constant ints (may be pre-computed) |
|||
**Instruction: <code>int1 = 1943438364/347;</code> |
|||
**Time: 38 ns |
|||
*Test (b): Time required to divide two random ints (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_int = larger/smaller;</code> |
|||
**Time: 486 ns |
|||
==== Square Root (Test 20) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test20-parta.png |
|||
| alt1 = Test (a): Time required to sqrt() a constant int |
|||
| caption1 = Test (a): Time required to sqrt() a constant int |
|||
| image2 = Test20-partb.png |
|||
| alt2 = Test (b): Time required to sqrt() a random int |
|||
| caption2 = Test (b): Time required to sqrt() a random int |
|||
| image3 = Test20-partc.png |
|||
| alt3 = Test (c): Time required to ^(1/2) a constant int |
|||
| caption3 = Test (c): Time required to ^(1/2) a constant int |
|||
| image4 = Test20-partd.png |
|||
| alt4 = Test (d): Time required to ^(1/2) a random int |
|||
| caption4 = Test (d): Time required to ^(1/2) a random int |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the square root of one 32-bit number (an int). Tests (a) and (b) use the 'sqrt()' method, while tests (c) and (d) use a number to the 1/2 power. |
|||
*Test (a): Time required to sqrt() a constant int (may be pre-computed) |
|||
**Instruction: <code>int1 = sqrt(347343777);</code> |
|||
**Time: 50 ns |
|||
*Test (b): Time required to sqrt() a random int (guaranteed not to be pre-comp) |
|||
**Instruction: <code>int2 = sqrt(random_int1);</code> |
|||
**Time: 8737 ns |
|||
*Test (c): Time required to ^(1/2) a constant int (may be pre-computed) |
|||
**Instruction: <code>int3 = (743347343)^(1/2);</code> |
|||
**Time: 88 ns |
|||
*Test (d): Time required to ^(1/2) a random int (guaranteed not to be pre-comp) |
|||
**Instruction: <code>int4 = (random_int2)^(1/2);</code> |
|||
**Time: 74 ns |
|||
==== Sine (Test 21) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test21-parta.png |
|||
| alt1 = Test (a): Time required to take the sine of a constant int |
|||
| caption1 = Test (a): Time required to take the sine of a constant int |
|||
| image2 = Test21-partb.png |
|||
| alt2 = Test (b): Time required to take the sine of a random int |
|||
| caption2 = Test (b): Time required to take the sine of a random int |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the sine of one 32-bit number (an int). |
|||
*Test (a): Time required to take the sine of a constant int (may be pre-computed) |
|||
**Instruction: <code>int1 = sin(1347433747);</code> |
|||
**Time: 19488 ns |
|||
*Test (b): Time required to take the sine of a random int (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_int = sin(random_int1);</code> |
|||
**Time: 18988 ns |
|||
==== Cosine (Test 22) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test22-parta.png |
|||
| alt1 = Test (a): Time required to take the cosine of a constant int |
|||
| caption1 = Test (a): Time required to take the cosine of a constant int |
|||
| image2 = Test22-partb.png |
|||
| alt2 = Test (b): Time required to take the cosine of a random int |
|||
| caption2 = Test (b): Time required to take the cosine of a random int |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the cosine of one 32-bit number (an int). |
|||
*Test (a): Time required to take the cosine of a constant int (may be pre-computed) |
|||
**Instruction: <code>int1 = cos(1347433747);</code> |
|||
**Time: 20324 ns |
|||
*Test (b): Time required to take the cosine of a random int (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_int = cos(random_int1);</code> |
|||
**Time: 19837 ns |
|||
===long long Performance=== |
|||
---- |
|||
A long long data type, in ANSI C, is a value that holds 8 bytes, or 64 bits. This corresponds to a range of 0 to 1.84467441 × 10^19 (2^64 - 1). If the variable is signed (negative), then the range is from -9.22337204 × 10^18 to 9.22337204 × 10^18 (-2^63 + 1 to 2^63 -1). Given that there are many different ways to perform a given operation on a long long, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) long longs and all (b) benchmarks are operations on two random (and most likely '''not''' pre-computed) long longs. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time. |
|||
==== Subtraction (Test 23) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test23-parta.png |
|||
| alt1 = Test (a): Time required to subtract two constant long longs |
|||
| caption1 = Test (a): Time required to subtract two constant long longs |
|||
| image2 = Test23-partb.png |
|||
| alt2 = Test (b): Time required to subtract two random long longs |
|||
| caption2 = Test (b): Time required to subtract two random long longs |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to subtract one 64-bit number (a long long) from another 64-bit number (a long long). |
|||
*Test (a): Time required to subtract two constant long longs (may be pre-computed) |
|||
**Instruction: <code>longlong1 = 17179800000LL-7179869184LL;</code> |
|||
**Time: 186 ns |
|||
*Test (b): Time required to subtract two random long longs (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_longlong = larger-smaller;</code> |
|||
**Time: 150 ns |
|||
==== Addition (Test 24) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test24-parta.png |
|||
| alt1 = Test (a): Time required to add two constant long longs |
|||
| caption1 = Test (a): Time required to add two constant long longs |
|||
| image2 = Test24-partb.png |
|||
| alt2 = Test (b): Time required to add two random long longs |
|||
| caption2 = Test (b): Time required to add two random long longs |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to add one 64-bit number (a long long) to another 64-bit number (a long long). |
|||
*Test (a): Time required to add two constant long longs (may be pre-computed) |
|||
**Instruction: <code>longlong1 = 17179800000LL+179869184LL;</code> |
|||
**Time: 88 ns |
|||
*Test (b): Time required to add two random long longs (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_longlong = larger+smaller;</code> |
|||
**Time: 200 ns |
|||
==== Multiplication (Test 25) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test25-parta.png |
|||
| alt1 = Test (a): Time required to multiply two constant long longs |
|||
| caption1 = Test (a): Time required to multiply two constant long longs |
|||
| image2 = Test25-partb.png |
|||
| alt2 = Test (b): Time required to multiply two random long longs |
|||
| caption2 = Test (b): Time required to multiply two random long longs |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to multiply one 64-bit number (a long long) by another 64-bit number (a long long). |
|||
*Test (a): Time required to multiply two constant long longs (may be pre-computed) |
|||
**Instruction: <code>longlong1 = 171798000LL*7186984LL;</code> |
|||
**Time: 74 ns |
|||
*Test (b): Time required to multiply two random long longs (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_longlong = larger*smaller;</code> |
|||
**Time: 398 ns |
|||
==== Division (Test 26) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test26-parta.png |
|||
| alt1 = Test (a): Time required to divide two constant long longs |
|||
| caption1 = Test (a): Time required to divide two constant long longs |
|||
| image2 = Test26-partb.png |
|||
| alt2 = Test (b): Time required to divide two random long longs |
|||
| caption2 = Test (b): Time required to divide two random long longs |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to divide one 64-bit number (a long long) by another 64-bit number (a long long). |
|||
*Test (a): Time required to divide two constant long longs (may be pre-computed) |
|||
**Instruction: <code>longlong1 = 17179800000LL/7179869184LL;</code> |
|||
**Time: 74 ns |
|||
*Test (b): Time required to divide two random long longs (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_longlong = larger/smaller;</code> |
|||
**Time: 1724 ns |
|||
==== Square Root (Test 27) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test27-parta.png |
|||
| alt1 = Test (a): Time required to sqrt() a constant long long |
|||
| caption1 = Test (a): Time required to sqrt() a constant long long |
|||
| image2 = Test27-partb.png |
|||
| alt2 = Test (b): Time required to sqrt() a random long long |
|||
| caption2 = Test (b): Time required to sqrt() a random long long |
|||
| image3 = Test27-partc.png |
|||
| alt3 = Test (c): Time required to ^(1/2) a constant long long |
|||
| caption3 = Test (c): Time required to ^(1/2) a constant long long |
|||
| image4 = Test27-partd.png |
|||
| alt4 = Test (d): Time required to ^(1/2) a random long long |
|||
| caption4 = Test (d): Time required to ^(1/2) a random long long |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the square root of one 64-bit number (a long long). Tests (a) and (b) use the 'sqrt()' method, while tests (c) and (d) use a number to the 1/2 power. |
|||
*Test (a): Time required to sqrt() a constant long long (may be pre-computed) |
|||
**Instruction: <code>longlong1 = sqrt(17179800000LL);</code> |
|||
**Time: 87 ns |
|||
*Test (b): Time required to sqrt() a random long long (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_longlong = sqrt(random_longlong1);</code> |
|||
**Time: 16311 ns |
|||
*Test (c): Time required to ^(1/2) a constant long long (may be pre-computed) |
|||
**Instruction: <code>longlong1 = 17179800000LL^(1/2);</code> |
|||
**Time: 188 ns |
|||
*Test (d): Time required to ^(1/2) a random long long (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_longlong = random_longlong2^(1/2);</code> |
|||
**Time: 74 ns |
|||
==== Sine (Test 28) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test28-parta.png |
|||
| alt1 = Test (a): Time required to take the sine of a constant long long |
|||
| caption1 = Test (a): Time required to take the sine of a constant long long |
|||
| image2 = Test28-partb.png |
|||
| alt2 = Test (b): Time required to take the sine of a random long long |
|||
| caption2 = Test (b): Time required to take the sine of a random long long |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the sine of one 64-bit number (a long long). |
|||
*Test (a): Time required to take the sine of a constant long long (may be pre-computed) |
|||
**Instruction: <code>longlong1 = sin(1347433747);</code> |
|||
**Time: 23837 ns |
|||
*Test (b): Time required to take the sine of a random long long (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_longlong = sin(random_longlong1);</code> |
|||
**Time: 29898 ns |
|||
==== Cosine (Test 29) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test29-parta.png |
|||
| alt1 = Test (a): Time required to take the cosine of a constant long long |
|||
| caption1 = Test (a): Time required to take the cosine of a constant long long |
|||
| image2 = Test29-partb.png |
|||
| alt2 = Test (b): Time required to take the cosine of a random long long |
|||
| caption2 = Test (b): Time required to take the cosine of a random long long |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the cosine of one 64-bit number (a long long). |
|||
*Test (a): Time required to take the cosine of a constant long long (may be pre-computed) |
|||
**Instruction: <code>longlong1 = cos(1347433747);</code> |
|||
**Time: 24611 ns |
|||
*Test (b): Time required to take the cosine of a random long long (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_longlong = cos(random_longlong1);</code> |
|||
**Time: 30623 ns |
|||
===float Performance=== |
|||
---- |
|||
A float data type, in ANSI C, is a single-precision value represented by 4 bytes, consisting of a sign bit, an 8-bit excess-127 binary exponent, and a 23-bit mantissa. This roughly corresponds to a range of approximately 3.4E–38 to 3.4E+38 for unsigned floats. Signed floats are more complex to understand, but are covered in depth here: http://msdn.microsoft.com/en-us/library/hd7199ke%28VS.80%29.aspx. Given that there are many different ways to perform a given operation on a float, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) floats and all (b) benchmarks are operations on two random (and most likely '''not''' pre-computed) floats. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time. |
|||
==== Subtraction (Test 30) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test30-parta.png |
|||
| alt1 = Test (a): Time required to subtract two constant floats |
|||
| caption1 = Test (a): Time required to subtract two constant floats |
|||
| image2 = Test30-partb.png |
|||
| alt2 = Test (b): Time required to subtract two random floats |
|||
| caption2 = Test (b): Time required to subtract two random floats |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to subtract one 32-bit number (a float) from another 32-bit number (a float). |
|||
*Test (a): Time required to subtract two constant floats (may be pre-computed) |
|||
**Instruction: <code>float1 = 1347433747.0-7889.0;</code> |
|||
**Time: 100 ns |
|||
*Test (b): Time required to subtract two random floats (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_float = larger-smaller;</code> |
|||
**Time: 900 ns |
|||
==== Addition (Test 31) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test31-parta.png |
|||
| alt1 = Test (a): Time required to add two constant floats |
|||
| caption1 = Test (a): Time required to add two constant floats |
|||
| image2 = Test31-partb.png |
|||
| alt2 = Test (b): Time required to add two random floats |
|||
| caption2 = Test (b): Time required to add two random floats |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to add one 32-bit number (a float) to another 32-bit number (a float). |
|||
*Test (a): Time required to add two constant floats (may be pre-computed) |
|||
**Instruction: <code>float1 = 234232397.0+12353235.0;</code> |
|||
**Time: 124 ns |
|||
*Test (b): Time required to add two random floats (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_float = random_float1+random_float2;</code> |
|||
**Time: 1024 ns |
|||
==== Multiplication (Test 32) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test32-parta.png |
|||
| alt1 = Test (a): Time required to multiply two constant floats |
|||
| caption1 = Test (a): Time required to multiply two constant floats |
|||
| image2 = Test32-partb.png |
|||
| alt2 = Test (b): Time required to multiply two random floats |
|||
| caption2 = Test (b): Time required to multiply two random floats |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to multiply one 32-bit number (a float) by another 32-bit number (a float). |
|||
*Test (a): Time required to multiply two constant floats (may be pre-computed) |
|||
**Instruction: <code>float1 = 65500.0*650.0;</code> |
|||
**Time: 124 ns |
|||
*Test (b): Time required to multiply two random floats (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_float = random_float1*random_float2;</code> |
|||
**Time: 736 ns |
|||
==== Division (Test 33) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test33-parta.png |
|||
| alt1 = Test (a): Time required to divide two constant floats |
|||
| caption1 = Test (a): Time required to divide two constant floats |
|||
| image2 = Test33-partb.png |
|||
| alt2 = Test (b): Time required to divide two random floats |
|||
| caption2 = Test (b): Time required to divide two random floats |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to divide one 32-bit number (a float) by another 32-bit number (a float). |
|||
*Test (a): Time required to divide two constant floats (may be pre-computed) |
|||
**Instruction: <code>float1 = 1347433747.0/7889.0;</code> |
|||
**Time: 99 ns |
|||
*Test (b): Time required to divide two random floats (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_float = larger/smaller;</code> |
|||
**Time: 1674 ns |
|||
==== Square Root (Test 34) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test34-parta.png |
|||
| alt1 = Test (a): Time required to sqrt() a constant float |
|||
| caption1 = Test (a): Time required to sqrt() a constant float |
|||
| image2 = Test34-partb.png |
|||
| alt2 = Test (b): Time required to sqrt() a random float |
|||
| caption2 = Test (b): Time required to sqrt() a random float |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the square root of one 32-bit number (a float). Tests (a) and (b) use the 'sqrt()' method, while tests (c) and (d) use a number to the 1/2 power. |
|||
*Test (a): Time required to sqrt() a constant float (may be pre-computed) |
|||
**Instruction: <code>float1 = sqrt(1347433747.0);</code> |
|||
**Time: 99 ns |
|||
*Test (b): Time required to sqrt() a random float (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_float = sqrt(random_float1);</code> |
|||
**Time: 8636 ns |
|||
*Test (c): Time required to ^(1/2) a constant float (may be pre-computed) |
|||
**Instruction: |
|||
**Time: N/A ns* |
|||
*Test (d): Time required to ^(1/2) a random float (guaranteed not to be pre-comp) |
|||
**Instruction: |
|||
**Time: N/A ns* |
|||
<nowiki>* We had some difficulty in trying to test these particular operations. It seems that the PIC32 is not capable of raising floats to the 1/2 power. </nowiki> |
|||
==== Sine (Test 35) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test35-parta.png |
|||
| alt1 = Test (a): Time required to take the sine of a constant float |
|||
| caption1 = Test (a): Time required to take the sine of a constant float |
|||
| image2 = Test35-partb.png |
|||
| alt2 = Test (b): Time required to take the sine of a random float |
|||
| caption2 = Test (b): Time required to take the sine of a random float |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the sine of one 32-bit number (a float). |
|||
*Test (a): Time required to take the sine of a constant float (may be pre-computed) |
|||
**Instruction: <code>float1 = sin(1347433747.0);</code> |
|||
**Time: 19574 ns |
|||
*Test (b): Time required to take the sine of a random float (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_float = sin(random_float1);</code> |
|||
**Time: 19562 ns |
|||
==== Cosine (Test 36) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test36-parta.png |
|||
| alt1 = Test (a): Time required to take the cosine of a constant float |
|||
| caption1 = Test (a): Time required to take the cosine of a constant float |
|||
| image2 = Test36-partb.png |
|||
| alt2 = Test (b): Time required to take the cosine of a random float |
|||
| caption2 = Test (b): Time required to take the cosine of a random float |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the cosine of one 32-bit number (a float). |
|||
*Test (a): Time required to take the cosine of a constant float (may be pre-computed) |
|||
**Instruction: <code>float1 = cos(1347433747.0);</code> |
|||
**Time: 20311 ns |
|||
*Test (b): Time required to take the cosine of a random float (guaranteed not to be pre-comp) |
|||
**Instruction: <code>random_float = cos(random_float1);</code> |
|||
**Time: 20297 ns |
|||
===double Performance=== |
|||
---- |
|||
A double data type, in ANSI C, is a double-precision value represented by 8 bytes, consisting of a sign bit, an 8-bit excess-1023 binary exponent, and a 52-bit mantissa. This roughly corresponds to a range of approximately 2.22E–308 1.79E+308 for unsigned doubles. Signed doubles are more complex, but are covered in depth here: http://msdn.microsoft.com/en-us/library/hd7199ke%28VS.80%29.aspx. Given that there are many different ways to perform a given operation on a double, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) doubles and all (b) benchmarks are operations on two random (and most likely '''not''' pre-computed) doubles. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time. |
|||
==== Subtraction (Test 37) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test37-parta.png |
|||
| alt1 = Test (a): Time required to subtract two constant doubles |
|||
| caption1 = Test (a): Time required to subtract two constant doubles |
|||
| image2 = Test37-partb.png |
|||
| alt2 = Test (b): Time required to subtract of two random doubles |
|||
| caption2 = Test (b): Time required to subtract of two random doubles |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to subtract one 64-bit number (a double) from another 8-bit number (a double). |
|||
* Test (a): Time required to subtract two constant doubles (may be pre-computed) |
|||
** Instruction: <code>double1 = 17179800000.0-7179869184.0;</code> |
|||
** Time: 199 ns |
|||
* Test (b): Time required to subtract of two random doubles (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_double = larger-smaller;</code>* |
|||
** Time: 112 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Addition (Test 38) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test38-parta.png |
|||
| alt1 = Test (a): Time required to add two constant doubles |
|||
| caption1 = Test (a): Time required to add two constant doubles |
|||
| image2 = Test38-partb.png |
|||
| alt2 = Test (b): Time required to add of two random doubles |
|||
| caption2 = Test (b): Time required to add of two random doubles |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to add one 64-bit number (a double) to another 64-bit number (a double). |
|||
* Test (a): Time required to add two constant doubles (may be pre-computed) |
|||
** Instruction: <code>double1 = 17179800000.0+7179869184.0;</code> |
|||
** Time: 199 ns |
|||
* Test (b): Time required to add of two random doubles (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_double = larger+smaller;</code>* |
|||
** Time: 1236 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Multiplication (Test 39) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test39-parta.png |
|||
| alt1 = Test (a): Time required to multiply two constant doubles |
|||
| caption1 = Test (a): Time required to multiply two constant doubles |
|||
| image2 = Test39-partb.png |
|||
| alt2 = Test (b): Time required to multiply of two random doubles |
|||
| caption2 = Test (b): Time required to multiply of two random doubles |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to multiply one 64-bit number (a double) by another 64-bit number (a double). |
|||
* Test (a): Time required to multiply two constant doubles (may be pre-computed) |
|||
** Instruction: <code>double1 = 17179800.0*71798680.0;</code> |
|||
** Time: 188 ns |
|||
* Test (b): Time required to multiply of two random doubles (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_double = random_double1*random_double2;</code>* |
|||
** Time: 1438 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Division (Test 40) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test40-parta.png |
|||
| alt1 = Test (a): Time required to divide two constant doubles |
|||
| caption1 = Test (a): Time required to divide two constant doubles |
|||
| image2 = Test40-partb.png |
|||
| alt2 = Test (b): Time required to divide of two random doubles |
|||
| caption2 = Test (b): Time required to divide of two random doubles |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to divide one 64-bit number (a double) by another 64-bit number (a double). |
|||
* Test (a): Time required to divide two constant doubles (may be pre-computed) |
|||
** Instruction: <code>double1 = 17179800.0/71798680.0;</code> |
|||
** Time: 187 ns |
|||
* Test (b): Time required to divide of two random doubles (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_double = random_double1/random_double2;</code>* |
|||
** Time: 3184 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Square Root (Test 41) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test41-parta.png |
|||
| alt1 = Test (a): Time required to sqrt() a constant double |
|||
| caption1 = Test (a): Time required to sqrt() a constant double |
|||
| image2 = Test41-partb.png |
|||
| alt2 = Test (b): Time required to sqrt() an int cast into a double |
|||
| caption2 = Test (b): Time required to sqrt() an int cast into a double |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to square root a 64-bit number (a double). Benchmarks (a) through (c) use the <code>sqrt()</code> function while benchmarks (d) through (f) raise the operands to the 1/2 power. |
|||
* Test (a): Time required to sqrt() a constant double (may be pre-computed) |
|||
** Instruction: <code>double1 = sqrt(33359738444.0);</code> |
|||
** Time: 188 ns |
|||
* Test (b): Time required to sqrt() an int cast into a double (may be pre-computed) |
|||
** Instruction: <code>random_double = sqrt(random_double1);</code> |
|||
** Time: 7998 ns |
|||
* Test (c): Time required to sqrt() a random char (guaranteed not to be pre-comp) |
|||
** Instruction: |
|||
** Time: N/A ns * |
|||
* Test (d): Time required to ^(1/2) a constant char (may be pre-computed) |
|||
** Instruction: |
|||
** Time: N/A ns * |
|||
<nowiki>*</nowiki> We had some difficulty in trying to test these particular operations. It seems that the PIC32 is not capable of raising doubles to the 1/2 power. |
|||
==== Sine (Test 42) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test42-parta.png |
|||
| alt1 = Test (a): Time required to take the sine of a constant double |
|||
| caption1 = Test (a): Time required to take the sine of a constant double |
|||
| image2 = Test42-partb.png |
|||
| alt2 = Test (b): Time required to take the sine of a random double |
|||
| caption2 = Test (b): Time required to take the sine of a random double |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to take the sine of an 64-bit number (a double). |
|||
* Test (a): Time required to take the sine of a constant doule (may be pre-computed) |
|||
** Instruction: <code>double1 = sin(33359738444.0);</code> |
|||
** Time: 20299 ns |
|||
* Test (b): Time required to take the sine of a random double (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_double = sin(random_double1);</code>* |
|||
** Time: 20624 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
==== Cosine (Test 43) ==== |
|||
{{multiple image |
|||
| width = 100 |
|||
| image1 = Test43-parta.png |
|||
| alt1 = Test (a): Time required to take the cosine of a constant double |
|||
| caption1 = Test (a): Time required to take the cosine of a constant double |
|||
| image2 = Test43-partb.png |
|||
| alt2 = Test (b): Time required to take the cosine of a random double |
|||
| caption2 = Test (b): Time required to take the cosine of a random double |
|||
}} |
|||
This test determines the length of time required by the PIC32 chip to get the cosine of one 64-bit number (a double). |
|||
* Test (a): Time required to take the cosine of a constant double (may be pre-computed) |
|||
** Instruction: <code>double1 = cos(33359738444.0);</code> |
|||
** Time: 19762 ns |
|||
* Test (b): Time required to take the cosine of a random double (guaranteed not to be pre-comp) |
|||
** Instruction: <code>random_double = cos(random_double1);</code>* |
|||
** Time: 20011 ns |
|||
<nowiki>*</nowiki>See Code section for a more details on how randomness was guaranteed. |
|||
== Code == |
== Code == |
||
There is a c file for each test that was performed. They are included in a .zip at the bottom of this section. and can be downloaded for further review. Below is an example of the code, in this case, for testing multiplication of two chars. |
|||
Where possible, make it a single piece of well-commented cut-and-pastable code, or at least make each function that way, so others can easily copy it. Most comments should be in the code itself; outside the code (on the wiki) should only be explanatory comments that are too cumbersome to include in the code. |
|||
<pre> |
|||
/******************************************************************************* |
|||
* PIC32 Benchmarking Test Suite |
|||
* |
|||
* Test 4 - char Multiplication Duration |
|||
* Version 1.0 |
|||
* Copyright (C) 2010 Todd H. Poole, Katy Powers, Max Willer |
|||
* |
|||
* This test determines the length of time required by the PIC32 chip to multiply |
|||
* one 8-bit number (a char) by another 8-bit number (a char). |
|||
* |
|||
* A char data type, in ANSI C, is a value holding one byte, or one character |
|||
* code. The actual number of bits in a char in a particular implementation is |
|||
* documented as CHAR_BIT in that implementation's limits.h file. In practice, |
|||
* it is almost always 8 bits, corresponding to a decimal range of 0 to 255, |
|||
* inclusive. |
|||
* |
|||
* Given that there are many different ways to multiply one char by another, |
|||
* we've done our best to include several different multiplication methods that |
|||
* we feel are representative of normal coding practices. Depending on how the |
|||
* source code is compiled, these different methods may or may not produce |
|||
* different results. |
|||
*******************************************************************************/ |
|||
#include "HardwareProfile.h" |
|||
#define PIN_A2 LATAbits.LATA2 //Define the specific pin we'll use for our tests |
|||
char ascii_225; |
|||
char ascii_200; |
|||
char random_char; |
|||
float zero_to_1; |
|||
float zero_to_15; |
|||
int int_zero_to_15; |
|||
int random_int1; |
|||
int random_int2; |
|||
char random_char1; |
|||
char random_char2; |
|||
char larger; |
|||
char smaller; |
|||
int main(void) |
|||
{ |
|||
SYSTEMConfigPerformance(SYS_FREQ); |
|||
TRISAbits.TRISA2 = 0; //Setting our testing pin to be an output |
|||
/*Seed the random number generator so we don't get the same 'random' |
|||
* numbers over and over again. This is just to ensure that our |
|||
* C-compiler doesn't try to pull any optimization tricks on us. |
|||
*/ |
|||
srand(time(NULL)); |
|||
//Finding our first random char |
|||
/*Take the output of rand() - which is a number [0,RAND_MAX] - and |
|||
* divide that output by RAND_MAX+1. We now have a number that ranges |
|||
* from [0,1). Multiply this result by 16. This will give us a random |
|||
* number in the range of [0,16). (I chose 16 because the maximum value |
|||
* a char can be is 255. Having our range run from 0 to 16 exclusive will |
|||
* ensure that our ints - and, later, our chars - will range from [0,15]. |
|||
* Thus, regardless of what values our two ints/chars take, the final |
|||
* multiplication will fit in a char, which range from 0 to 255 in decimal). |
|||
* Cast this result into an int, which will truncate (cut off) the digits |
|||
* after the decimal. Thus: |
|||
* 0.00001 through 0.99999 will cast to 0. |
|||
* 255.00001 through 255.99999 will cast to 255. |
|||
* Then, cast our int from an int into a char. |
|||
* PS: Yes, I know this doesn't give us a uniform probability distribution, but |
|||
* we're not too concerned with true randomness... just enough to ensure nothing |
|||
* is computed ahead of time by the C-compiler. |
|||
*/ |
|||
zero_to_1 = rand() / RAND_MAX+1; //float that ranges from [0,1) |
|||
zero_to_15 = zero_to_1 * 16; //float that ranges from [0,16) |
|||
int_zero_to_15 = zero_to_15; //int that ranges from [0,15] |
|||
random_int1 = int_zero_to_15; //our first random int |
|||
random_char1 = int_zero_to_15; //our first random char |
|||
//Finding our second random char |
|||
zero_to_1 = rand() / RAND_MAX+1; //float that ranges from [0,1) |
|||
zero_to_15 = zero_to_1 * 16; //float that ranges from [0,256) |
|||
int_zero_to_15 = zero_to_15; //int that ranges from [0,255] |
|||
random_int2 = int_zero_to_15; //our second random int |
|||
random_char2 = int_zero_to_15; //out second random char |
|||
/*By forcing all of our operations to depend on the results of randomly |
|||
* generated numbers, we've guaranteed that our C-compiler won't be able to |
|||
* interfere with our benchmarks by trying to compute our results in advance. |
|||
* This ensures that all operations are performed on the fly by the PIC, and |
|||
* that it's performance will be similar to what one might encounter in other |
|||
* projects where all variables and data are not completely known in advance. |
|||
*/ |
|||
//Time to start the actual benchmarking |
|||
while(1) |
|||
{ |
|||
/*We start our while loop off like this so that we can create an easily |
|||
* recognizable pattern on the oscilloscope. We need to be able to |
|||
* differentiate between the various tests, and so, after seeing this unique |
|||
* pattern, we'll know that next test to follow will be the first test, the |
|||
* test after that will be the second, etc. |
|||
*/ |
|||
PIN_A2 = 1; |
|||
PIN_A2 = 0; |
|||
PIN_A2 = 0; |
|||
PIN_A2 = 0; |
|||
PIN_A2 = 1; |
|||
PIN_A2 = 0; |
|||
//Test 4(a) - Multiplication of Two Constant chars (may be pre-computed) |
|||
PIN_A2 = 1; |
|||
ascii_225 = 'K'*'?'; |
|||
PIN_A2 = 0; |
|||
//Test 4(b) - Multiplication of Two ints Cast into a char (may be pre-computed) |
|||
PIN_A2 = 1; |
|||
ascii_200 = 100*2; |
|||
PIN_A2 = 0; |
|||
//Test 4(c) - Mulitplication of Two Random chars (guaranteed not to be pre-comp) |
|||
PIN_A2 = 1; |
|||
random_char = larger*smaller; |
|||
PIN_A2 = 0; |
|||
PIN_A2 = 1; |
|||
} |
|||
} |
|||
</pre> |
|||
[[Media:Kp_mw_tp_ME333PIC32Benchmarking.zip]] |
Latest revision as of 14:20, 1 March 2010
Overview
We were tasked with determining the real-time cost (measured in nanoseconds) of performing seven basic mathematical operations with each one of the six commonly used ANSI C data types.
The mathematical operations we tested were:
- subtraction
- addition
- multiplication
- division
- square root
- sine
- cosine
The six data types we tested each operation on were:
- char
- short
- integer
- long long
- float
- double
Our testing procedure was simple: throw an output pin high on the NU32 development board, perform a mathematical operation with a given data type, and then pull the same pin low.
Placing the above three steps in an infinite while loop afforded us the opportunity to use an oscilloscope to measure the duration between each high-low pair in the output waveform. After subtracting the time it took for the PIC to raise and lower the voltage on the output pin (something we previously measured), we were able to determine the amount of time required for the PIC32 chip to execute an operation with a high level of accuracy.
With seven operations to perform on six different data types, we created the following table to help us assign and keep track of the various tests we planned to run:
char (8-bit) | short (16-bit) | int (32-bit) | long long (64-bit) | float (32-bit) | double (64-bit) | |
---|---|---|---|---|---|---|
subtraction | Test 2 | Test 9 | Test 16 | Test 23 | Test 30 | Test 37 |
addition | Test 3 | Test 10 | Test 17 | Test 24 | Test 31 | Test 38 |
multiplication | Test 4 | Test 11 | Test 18 | Test 25 | Test 32 | Test 39 |
division | Test 5 | Test 12 | Test 19 | Test 26 | Test 33 | Test 40 |
square root | Test 6 | Test 13 | Test 20 | Test 27 | Test 34 | Test 41 |
sine | Test 7 | Test 14 | Test 21 | Test 28 | Test 35 | Test 42 |
cosine | Test 8 | Test 15 | Test 22 | Test 29 | Test 36 | Test 43 |
Several tests contained multiple procedures that explored various ways to carry out a given mathematical operation on a given data type. For example, in the multiplication tests, not only did we test the traditional multiplication operator (*), but also the bitwise left shift operator (<<). Our goal was to find out if one particular operator was faster than the other. Similarly, we also included procedures that performed the above operations on hard-coded numbers (such as 347) as well as randomly chosen numbers stored in variables (such as 'random_int1'). We wanted to ensure that the compiler didn't compute the results of each operation in advance. While pre-compiling can indeed afford welcome increases in execution time, situations in which the compiler can't optimize the operations ahead of time (for example, situations where the data to be operated on is not known in advance) are still common occurrences and are worth benchmarking.
Accordingly, several tests contain multiple procedures that not only account for multiple methods of performing a particular operation, but multiple sets of numbers to perform those operations on.
Test 1 was used to determine the duration required for the PIC32 to throw a pin high and pull a pin low, while Tests 2 through 43 were used to measure the actual performance of each operation and data-type pair.
Circuit
The circuit used for testing consisted of connecting the oscilloscope to the output pin (in this case, pin A2) in order to view the waveform.
Results
Below is the quick summary of the testing results comparing each data type and each operation. All results are normalized to 60 ns (1.00 = 60 ns).
Subtraction | Addition | Multiplication | Division | Square Root | Sine | Cosine | ||
---|---|---|---|---|---|---|---|---|
Char | 1.87 | 1.65 | 2.28 | N/A | 34.78 | 116.03 | 98.93 | |
Short | 1.03 | 1.67 | 1.47 | 5.00 | 144.57 | 230.40 | 215.40 | |
Int | 1.07 | 1.00 | 1.43 | 8.10 | 145.62 | 316.43 | 330.62 | |
Long Long | 2.50 | 3.33 | 6.63 | 28.73 | 271.85 | 498.30 | 510.38 | |
Float | 15.00 | 17.07 | 12.27 | 27.90 | 143.93 | 326.03 | 338.28 | |
Double | 26.00 | 20.60 | 23.97 | 53.07 | 133.30 | 343.73 | 333.52 |
Here is the Excel file with the full results, in nanoseconds and normalized. Media:Lab5.xls
Below are the results of each particular test we performed, coupled with a short explanation for each result.
Basic Timing Constants (Test 1)
This test determines the length of time required by the PIC32 chip to push a given output pin high and pull the same pin low.
- Test (a): Time required to throw an output pin high
- Instruction:
PIN_A2 = 1;
- Time: 63 ns
- Instruction:
- Test (b): Time required to pull an output pin low
- Instruction:
PIN_A2 = 0;
- Time: 63 ns
- Instruction:
- Test (c): Time required to execute 1 empty while loop cycle
- Instruction:
while(1){}
- Time: 23 ns
- Instruction:
char Performance
A char
data type, in ANSI C, is a value holding one byte, or one character code. The actual number of bits in a char in a particular implementation is documented as CHAR_BIT in that implementation's limits.h
file. In practice, it is almost always 8 bits, corresponding to a decimal range of 0 to 255 inclusive. Given that there are many different ways to perform a given operation on a char, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) ASCII letters, all (b) benchmarks are operations on two predefined (and most likely pre-computed) numbers in the range of 0 to 255, and all (c) benchmarks are operations on two random (and most likely not pre-computed) numbers. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time.
Subtraction (Test 2)
This test determines the length of time required by the PIC32 chip to subtract one 8-bit number (a char) from another 8-bit number (a char).
- Test (a): Time required to subtract two constant chars (may be pre-computed)
- Instruction:
letter_capital_a = 'z'-'7';
- Time: 50 ns
- Instruction:
- Test (b): Time required to subtract two ints cast into a char (may be pre-computed)
- Instruction:
letter_b = 100-2;
- Time: 50 ns
- Instruction:
- Test (c): Time required to subtract two random chars (guaranteed not to be pre-comp)
- Instruction:
random_char = larger-smaller;
* - Time: 112 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Addition (Test 3)
This test determines the length of time required by the PIC32 chip to add one 8-bit number (a char) to another 8-bit number (a char).
- Test (a): Time required to add two constant chars (may be pre-computed)
- Instruction:
letter_a = ')'+'8';
- Time: 50 ns
- Instruction:
- Test (b): Time required to add two ints cast into a char (may be pre-computed)
- Instruction:
letter_b = 97+1;
- Time: 50 ns
- Instruction:
- Test (c): Time required to add two random chars (guaranteed not to be pre-comp)
- Instruction:
random_char = random_char1+random_char2;
* - Time: 99 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Multiplication (Test 4)
This test determines the length of time required by the PIC32 chip to multiply one 8-bit number (a char) by another 8-bit number (a char).
- Test (a): Time required to multiply two constant chars (may be pre-computed)
- Instruction:
ascii_225 = 'K'*'♥';
- Time: 49 ns
- Instruction:
- Test (b): Time required to multiply two ints cast into a char (may be pre-computed)
- Instruction:
ascii_200 = 100*2;
- Time: 48 ns
- Instruction:
- Test (c): Time required to multiply of two random chars (guaranteed not to be pre-comp)
- Instruction:
random_char = larger*smaller;
* - Time: 137 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Division (Test 5)
This test determines the length of time required by the PIC32 chip to divide one 8-bit number (a char) by another 8-bit number (a char).
- Test (a): Time required to divide two constant chars (may be pre-computed)
- Instruction:
ascii_25 = 'K'/'♥'; //thp
- Time: 48 ns
- Instruction:
- Test (b): Time required to divide two ints cast into a char (may be pre-computed)
- Instruction:
letter_2 = 100/2;
- Time: 50 ns
- Instruction:
- Test (c): Time required to divide two random chars (guaranteed not to be pre-comp)
- Instruction:
random_char = larger/smaller;
* - Time: N/A
- Instruction:
* We had great difficulty in trying to test this particular operation. After some research with an oscilloscope and voltmeter, it seems that the PIC32 is not capable of dividing chars in this way. More specifically, every time the PIC32 attempts to divide one char by another, all output pins are immediately grounded. We've tested this code in other C environments, and it works as expected, so the error must either lie somewhere within our specific PICs (which would be very unlikely - we tested 3) or the silicon architecture of the PIC32 itself (sill unlikely, but given the number of PICs we tested, more probable). If you absolutely need to divide chars, cast them to ints first, perform your division, then cast them back to chars.
Square Root (Test 6)
This test determines the length of time required by the PIC32 chip to square root one 8-bit number (a char). Benchmarks (a) through (c) use the sqrt()
function while benchmarks (d) through (f) raise the operands to the 1/2 power.
- Test (a): Time required to sqrt() a constant char (may be pre-computed)
- Instruction:
ascii_25 = sqrt('u');
- Time: 48 ns
- Instruction:
- Test (b): Time required to sqrt() an int cast into a char (may be pre-computed)
- Instruction:
number_10 = sqrt(100);
- Time: 48 ns
- Instruction:
- Test (c): Time required to sqrt() a random char (guaranteed not to be pre-comp)
- Instruction:
random_char = sqrt(random_char1);
- Time: 2087 ns
- Instruction:
- Test (d): Time required to ^(1/2) a constant char (may be pre-computed)
- Instruction:
ascii_25 = ('u')^(1/2);
- Time: 48 ns
- Instruction:
- Test (e): Time required to ^(1/2) an int cast into a char (may be pre-computed)
- Instruction:
number_10 = (100)^(1/2);
- Time: 48 ns
- Instruction:
- Test (f): Time required to ^(1/2) a random char (guaranteed not to be pre-comp)
- Instruction:
random_char = (random_char2)^(1/2);
* - Time: 75 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Sine (Test 7)
This test determines the length of time required by the PIC32 chip to take the sine of an 8-bit number (a char).
- Test (a): Time required to take the sine of a constant char (may be pre-computed)
- Instruction:
ascii_25 = sin('K');
- Time: 9963 ns
- Instruction:
- Test (b): Time required to take the sine of a constant int cast into a char (may be pre-computed)
- Instruction:
letter_2 = sin(50);
- Time: 9550 ns
- Instruction:
- Test (c): Time required to take the sine of a random char (guaranteed not to be pre-comp)
- Instruction:
random_char = sin(larger);
* - Time: 6962 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Cosine (Test 8)
This test determines the length of time required by the PIC32 chip to take the cosine of an 8-bit number (a char).
- Test (a): Time required to take the cosine of a constant char (may be pre-computed)
- Instruction:
ascii_25 = cos('K');
- Time: 9111 ns
- Instruction:
- Test (b): Time required to take the cosine of a constant int cast into a char (may be pre-computed)
- Instruction:
letter_2 = cos(50);
- Time: 8724 ns
- Instruction:
- Test (c): Time required to take the cosine of a random char (guaranteed not to be pre-comp)
- Instruction:
random_char = cos(larger);
* - Time: 5936 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
short Performance
A short data type, in ANSI C, is a value that holds 2 bytes, or 16 bits. This corresponds to a range of 0 to 65535 (2^16 - 1). If the variable is signed (negative), then the range is from -32767 to 32767 (-2^15 + 1 to 2^15 -1). Given that there are many different ways to perform a given operation on a short, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) shorts and all (b) benchmarks are operations on two random (and most likely not pre-computed) shorts. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time.
Subtraction (Test 9)
This test determines the length of time required by the PIC32 chip to subtract one 16-bit number (a short) from another 16-bit number (a short).
- Test (a): Time required to subtract two constant shorts (may be pre-computed)
- Instruction:
short1 = 1337-343;
- Time: 25 ns
- Instruction:
- Test (b): Time required to subtract two random shorts (guaranteed not to be pre-comp)
- Instruction:
random_short = larger-smaller;
- Time: 62 ns
- Instruction:
Addition (Test 10)
This test determines the length of time required by the PIC32 chip to add one 16-bit number (a short) to another 16-bit number (a short).
- Test (a): Time required to add two constant shorts (may be pre-computed)
- Instruction:
short1 = 1337+343;
- Time: 50 ns
- Instruction:
- Test (b): Time required to add two random shorts (guaranteed not to be pre-comp)
- Instruction:
random_short = larger-smaller;
- Time: 100 ns
- Instruction:
Multiplication (Test 11)
This test determines the length of time required by the PIC32 chip to multiply one 16-bit number (a short) by another 16-bit number (a short).
- Test (a): Time required to multiply two constant shorts (may be pre-computed)
- Instruction:
short1 = 47*347;
- Time: 24 ns
- Instruction:
- Test (b): Time required to multiply two random shorts (guaranteed not to be pre-comp)
- Instruction:
random_short = larger*smaller;
- Time: 88 ns
- Instruction:
Division (Test 12)
This test determines the length of time required by the PIC32 chip to divide one 16-bit number (a short) by another 16-bit number (a short).
- Test (a): Time required to divide two constant shorts (may be pre-computed)
- Instruction:
short1 = 62488/347;
- Time: 28 ns
- Instruction:
- Test (b): Time required to divide two random shorts (guaranteed not to be pre-comp)
- Instruction:
random_short = larger/smaller;
- Time: 300 ns
- Instruction:
Square Root (Test 13)
This test determines the length of time required by the PIC32 chip to get the square root of one 16-bit number (a short). Tests (a) and (b) use the 'sqrt()' method, while tests (c) and (d) use a number to the 1/2 power.
- Test (a): Time required to sqrt() a constant short (may be pre-computed)
- Instruction:
short1 = sqrt(31337);
- Time: 50 ns
- Instruction:
- Test (b): Time required to sqrt() a random short (guaranteed not to be pre-comp)*
- Instruction:
short2 = sqrt(random_short1);
- Time: 8674 ns
- Instruction:
- Test (c): Time required to ^(1/2) a constant short (may be pre-computed)
- Instruction:
short3 = (30343)^(1/2);
- Time: 50 ns
- Instruction:
- Test (d): Time required to ^(1/2) a random short (guaranteed not to be pre-comp)*
- Instruction:
short4 = (random_short2)^(1/2);
- Time: 76 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Sine (Test 14)
This test determines the length of time required by the PIC32 chip to get the sine of one 16-bit number (a short).
- Test (a): Time required to take the sine of a constant short (may be pre-computed)
- Instruction:
short1 = sin(31337);
- Time: 13014 ns
- Instruction:
- Test (b): Time required to take the sine of a random short (guaranteed not to be pre-comp)
- Instruction:
random_short = sin(random_short1);
- Time: 13824 ns
- Instruction:
Cosine (Test 15)
This test determines the length of time required by the PIC32 chip to get the cosine of one 16-bit number (a short).
- Test (a): Time required to take the cosine of a constant short (may be pre-computed)
- Instruction:
short1 = cos(31337);
- Time: 12174 ns
- Instruction:
- Test (b): Time required to take the cosine of a random short (guaranteed not to be pre-comp)
- Instruction:
random_short = cos(random_short1);
- Time: 12924 ns
- Instruction:
int Performance
An int data type, in ANSI C, is a value that holds 4 bytes, or 32 bits. This corresponds to a range of 0 to 4294967295 (2^32 - 1). If the variable is signed (negative), then the range is from -2147483647 to 2147483647 (-2^31 + 1 to 2^31 -1). Given that there are many different ways to perform a given operation on an int, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) ints and all (b) benchmarks are operations on two random (and most likely not pre-computed) ints. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time.
Subtraction (Test 16)
This test determines the length of time required by the PIC32 chip to subtract one 32-bit number (an int) from another 32-bit number (an int).
- Test (a): Time required to subtract two constant ints (may be pre-computed)
- Instruction:
int1 = 2271988-7889;
- Time: 38 ns
- Instruction:
- Test (b): Time required to subtract two random ints (guaranteed not to be pre-comp)
- Instruction:
random_int = larger-smaller;
- Time: 64 ns
- Instruction:
Addition (Test 17)
This test determines the length of time required by the PIC32 chip to add one 32-bit number (an int) to another 32-bit number (an int).
- Test (a): Time required to add two constant ints (may be pre-computed)
- Instruction:
- Time: 26 ns
- Test (b): Time required to add two random ints (guaranteed not to be pre-comp)
- Instructions:
- Time: 60 ns
Multiplication (Test 18)
This test determines the length of time required by the PIC32 chip to multiply one 32-bit number (an int) by another 32-bit number (an int).
- Test (a): Time required to multiply two constant ints (may be pre-computed)
- Instruction:
int1 = 65500*6550;
- Time: 38 ns
- Instruction:
- Test (b): Time required to multiply two random ints (guaranteed not to be pre-comp)
- Instruction:
random_int = random_int1*random_int2;
- Time: 86 ns
- Instruction:
Division (Test 19)
This test determines the length of time required by the PIC32 chip to divide one 32-bit number (an int) by another 32-bit number (an int).
- Test (a): Time required to divide two constant ints (may be pre-computed)
- Instruction:
int1 = 1943438364/347;
- Time: 38 ns
- Instruction:
- Test (b): Time required to divide two random ints (guaranteed not to be pre-comp)
- Instruction:
random_int = larger/smaller;
- Time: 486 ns
- Instruction:
Square Root (Test 20)
This test determines the length of time required by the PIC32 chip to get the square root of one 32-bit number (an int). Tests (a) and (b) use the 'sqrt()' method, while tests (c) and (d) use a number to the 1/2 power.
- Test (a): Time required to sqrt() a constant int (may be pre-computed)
- Instruction:
int1 = sqrt(347343777);
- Time: 50 ns
- Instruction:
- Test (b): Time required to sqrt() a random int (guaranteed not to be pre-comp)
- Instruction:
int2 = sqrt(random_int1);
- Time: 8737 ns
- Instruction:
- Test (c): Time required to ^(1/2) a constant int (may be pre-computed)
- Instruction:
int3 = (743347343)^(1/2);
- Time: 88 ns
- Instruction:
- Test (d): Time required to ^(1/2) a random int (guaranteed not to be pre-comp)
- Instruction:
int4 = (random_int2)^(1/2);
- Time: 74 ns
- Instruction:
Sine (Test 21)
This test determines the length of time required by the PIC32 chip to get the sine of one 32-bit number (an int).
- Test (a): Time required to take the sine of a constant int (may be pre-computed)
- Instruction:
int1 = sin(1347433747);
- Time: 19488 ns
- Instruction:
- Test (b): Time required to take the sine of a random int (guaranteed not to be pre-comp)
- Instruction:
random_int = sin(random_int1);
- Time: 18988 ns
- Instruction:
Cosine (Test 22)
This test determines the length of time required by the PIC32 chip to get the cosine of one 32-bit number (an int).
- Test (a): Time required to take the cosine of a constant int (may be pre-computed)
- Instruction:
int1 = cos(1347433747);
- Time: 20324 ns
- Instruction:
- Test (b): Time required to take the cosine of a random int (guaranteed not to be pre-comp)
- Instruction:
random_int = cos(random_int1);
- Time: 19837 ns
- Instruction:
long long Performance
A long long data type, in ANSI C, is a value that holds 8 bytes, or 64 bits. This corresponds to a range of 0 to 1.84467441 × 10^19 (2^64 - 1). If the variable is signed (negative), then the range is from -9.22337204 × 10^18 to 9.22337204 × 10^18 (-2^63 + 1 to 2^63 -1). Given that there are many different ways to perform a given operation on a long long, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) long longs and all (b) benchmarks are operations on two random (and most likely not pre-computed) long longs. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time.
Subtraction (Test 23)
This test determines the length of time required by the PIC32 chip to subtract one 64-bit number (a long long) from another 64-bit number (a long long).
- Test (a): Time required to subtract two constant long longs (may be pre-computed)
- Instruction:
longlong1 = 17179800000LL-7179869184LL;
- Time: 186 ns
- Instruction:
- Test (b): Time required to subtract two random long longs (guaranteed not to be pre-comp)
- Instruction:
random_longlong = larger-smaller;
- Time: 150 ns
- Instruction:
Addition (Test 24)
This test determines the length of time required by the PIC32 chip to add one 64-bit number (a long long) to another 64-bit number (a long long).
- Test (a): Time required to add two constant long longs (may be pre-computed)
- Instruction:
longlong1 = 17179800000LL+179869184LL;
- Time: 88 ns
- Instruction:
- Test (b): Time required to add two random long longs (guaranteed not to be pre-comp)
- Instruction:
random_longlong = larger+smaller;
- Time: 200 ns
- Instruction:
Multiplication (Test 25)
This test determines the length of time required by the PIC32 chip to multiply one 64-bit number (a long long) by another 64-bit number (a long long).
- Test (a): Time required to multiply two constant long longs (may be pre-computed)
- Instruction:
longlong1 = 171798000LL*7186984LL;
- Time: 74 ns
- Instruction:
- Test (b): Time required to multiply two random long longs (guaranteed not to be pre-comp)
- Instruction:
random_longlong = larger*smaller;
- Time: 398 ns
- Instruction:
Division (Test 26)
This test determines the length of time required by the PIC32 chip to divide one 64-bit number (a long long) by another 64-bit number (a long long).
- Test (a): Time required to divide two constant long longs (may be pre-computed)
- Instruction:
longlong1 = 17179800000LL/7179869184LL;
- Time: 74 ns
- Instruction:
- Test (b): Time required to divide two random long longs (guaranteed not to be pre-comp)
- Instruction:
random_longlong = larger/smaller;
- Time: 1724 ns
- Instruction:
Square Root (Test 27)
This test determines the length of time required by the PIC32 chip to get the square root of one 64-bit number (a long long). Tests (a) and (b) use the 'sqrt()' method, while tests (c) and (d) use a number to the 1/2 power.
- Test (a): Time required to sqrt() a constant long long (may be pre-computed)
- Instruction:
longlong1 = sqrt(17179800000LL);
- Time: 87 ns
- Instruction:
- Test (b): Time required to sqrt() a random long long (guaranteed not to be pre-comp)
- Instruction:
random_longlong = sqrt(random_longlong1);
- Time: 16311 ns
- Instruction:
- Test (c): Time required to ^(1/2) a constant long long (may be pre-computed)
- Instruction:
longlong1 = 17179800000LL^(1/2);
- Time: 188 ns
- Instruction:
- Test (d): Time required to ^(1/2) a random long long (guaranteed not to be pre-comp)
- Instruction:
random_longlong = random_longlong2^(1/2);
- Time: 74 ns
- Instruction:
Sine (Test 28)
This test determines the length of time required by the PIC32 chip to get the sine of one 64-bit number (a long long).
- Test (a): Time required to take the sine of a constant long long (may be pre-computed)
- Instruction:
longlong1 = sin(1347433747);
- Time: 23837 ns
- Instruction:
- Test (b): Time required to take the sine of a random long long (guaranteed not to be pre-comp)
- Instruction:
random_longlong = sin(random_longlong1);
- Time: 29898 ns
- Instruction:
Cosine (Test 29)
This test determines the length of time required by the PIC32 chip to get the cosine of one 64-bit number (a long long).
- Test (a): Time required to take the cosine of a constant long long (may be pre-computed)
- Instruction:
longlong1 = cos(1347433747);
- Time: 24611 ns
- Instruction:
- Test (b): Time required to take the cosine of a random long long (guaranteed not to be pre-comp)
- Instruction:
random_longlong = cos(random_longlong1);
- Time: 30623 ns
- Instruction:
float Performance
A float data type, in ANSI C, is a single-precision value represented by 4 bytes, consisting of a sign bit, an 8-bit excess-127 binary exponent, and a 23-bit mantissa. This roughly corresponds to a range of approximately 3.4E–38 to 3.4E+38 for unsigned floats. Signed floats are more complex to understand, but are covered in depth here: http://msdn.microsoft.com/en-us/library/hd7199ke%28VS.80%29.aspx. Given that there are many different ways to perform a given operation on a float, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) floats and all (b) benchmarks are operations on two random (and most likely not pre-computed) floats. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time.
Subtraction (Test 30)
This test determines the length of time required by the PIC32 chip to subtract one 32-bit number (a float) from another 32-bit number (a float).
- Test (a): Time required to subtract two constant floats (may be pre-computed)
- Instruction:
float1 = 1347433747.0-7889.0;
- Time: 100 ns
- Instruction:
- Test (b): Time required to subtract two random floats (guaranteed not to be pre-comp)
- Instruction:
random_float = larger-smaller;
- Time: 900 ns
- Instruction:
Addition (Test 31)
This test determines the length of time required by the PIC32 chip to add one 32-bit number (a float) to another 32-bit number (a float).
- Test (a): Time required to add two constant floats (may be pre-computed)
- Instruction:
float1 = 234232397.0+12353235.0;
- Time: 124 ns
- Instruction:
- Test (b): Time required to add two random floats (guaranteed not to be pre-comp)
- Instruction:
random_float = random_float1+random_float2;
- Time: 1024 ns
- Instruction:
Multiplication (Test 32)
This test determines the length of time required by the PIC32 chip to multiply one 32-bit number (a float) by another 32-bit number (a float).
- Test (a): Time required to multiply two constant floats (may be pre-computed)
- Instruction:
float1 = 65500.0*650.0;
- Time: 124 ns
- Instruction:
- Test (b): Time required to multiply two random floats (guaranteed not to be pre-comp)
- Instruction:
random_float = random_float1*random_float2;
- Time: 736 ns
- Instruction:
Division (Test 33)
This test determines the length of time required by the PIC32 chip to divide one 32-bit number (a float) by another 32-bit number (a float).
- Test (a): Time required to divide two constant floats (may be pre-computed)
- Instruction:
float1 = 1347433747.0/7889.0;
- Time: 99 ns
- Instruction:
- Test (b): Time required to divide two random floats (guaranteed not to be pre-comp)
- Instruction:
random_float = larger/smaller;
- Time: 1674 ns
- Instruction:
Square Root (Test 34)
This test determines the length of time required by the PIC32 chip to get the square root of one 32-bit number (a float). Tests (a) and (b) use the 'sqrt()' method, while tests (c) and (d) use a number to the 1/2 power.
- Test (a): Time required to sqrt() a constant float (may be pre-computed)
- Instruction:
float1 = sqrt(1347433747.0);
- Time: 99 ns
- Instruction:
- Test (b): Time required to sqrt() a random float (guaranteed not to be pre-comp)
- Instruction:
random_float = sqrt(random_float1);
- Time: 8636 ns
- Instruction:
- Test (c): Time required to ^(1/2) a constant float (may be pre-computed)
- Instruction:
- Time: N/A ns*
- Test (d): Time required to ^(1/2) a random float (guaranteed not to be pre-comp)
- Instruction:
- Time: N/A ns*
* We had some difficulty in trying to test these particular operations. It seems that the PIC32 is not capable of raising floats to the 1/2 power.
Sine (Test 35)
This test determines the length of time required by the PIC32 chip to get the sine of one 32-bit number (a float).
- Test (a): Time required to take the sine of a constant float (may be pre-computed)
- Instruction:
float1 = sin(1347433747.0);
- Time: 19574 ns
- Instruction:
- Test (b): Time required to take the sine of a random float (guaranteed not to be pre-comp)
- Instruction:
random_float = sin(random_float1);
- Time: 19562 ns
- Instruction:
Cosine (Test 36)
This test determines the length of time required by the PIC32 chip to get the cosine of one 32-bit number (a float).
- Test (a): Time required to take the cosine of a constant float (may be pre-computed)
- Instruction:
float1 = cos(1347433747.0);
- Time: 20311 ns
- Instruction:
- Test (b): Time required to take the cosine of a random float (guaranteed not to be pre-comp)
- Instruction:
random_float = cos(random_float1);
- Time: 20297 ns
- Instruction:
double Performance
A double data type, in ANSI C, is a double-precision value represented by 8 bytes, consisting of a sign bit, an 8-bit excess-1023 binary exponent, and a 52-bit mantissa. This roughly corresponds to a range of approximately 2.22E–308 1.79E+308 for unsigned doubles. Signed doubles are more complex, but are covered in depth here: http://msdn.microsoft.com/en-us/library/hd7199ke%28VS.80%29.aspx. Given that there are many different ways to perform a given operation on a double, we've done our best to include several different methods that we feel are representative of normal coding practices. Depending on how the source code is compiled, these different methods may or may not produce different results. Furthermore, unless otherwise noted, all (a) benchmarks are operations on two predefined (and mot likely pre-computed) doubles and all (b) benchmarks are operations on two random (and most likely not pre-computed) doubles. These multiple benchmarks per test exist to illustrate the differences in execution time between pre-compiled operations that the compiler may have optimized ahead of time and operations the PIC must perform in real time.
Subtraction (Test 37)
This test determines the length of time required by the PIC32 chip to subtract one 64-bit number (a double) from another 8-bit number (a double).
- Test (a): Time required to subtract two constant doubles (may be pre-computed)
- Instruction:
double1 = 17179800000.0-7179869184.0;
- Time: 199 ns
- Instruction:
- Test (b): Time required to subtract of two random doubles (guaranteed not to be pre-comp)
- Instruction:
random_double = larger-smaller;
* - Time: 112 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Addition (Test 38)
This test determines the length of time required by the PIC32 chip to add one 64-bit number (a double) to another 64-bit number (a double).
- Test (a): Time required to add two constant doubles (may be pre-computed)
- Instruction:
double1 = 17179800000.0+7179869184.0;
- Time: 199 ns
- Instruction:
- Test (b): Time required to add of two random doubles (guaranteed not to be pre-comp)
- Instruction:
random_double = larger+smaller;
* - Time: 1236 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Multiplication (Test 39)
This test determines the length of time required by the PIC32 chip to multiply one 64-bit number (a double) by another 64-bit number (a double).
- Test (a): Time required to multiply two constant doubles (may be pre-computed)
- Instruction:
double1 = 17179800.0*71798680.0;
- Time: 188 ns
- Instruction:
- Test (b): Time required to multiply of two random doubles (guaranteed not to be pre-comp)
- Instruction:
random_double = random_double1*random_double2;
* - Time: 1438 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Division (Test 40)
This test determines the length of time required by the PIC32 chip to divide one 64-bit number (a double) by another 64-bit number (a double).
- Test (a): Time required to divide two constant doubles (may be pre-computed)
- Instruction:
double1 = 17179800.0/71798680.0;
- Time: 187 ns
- Instruction:
- Test (b): Time required to divide of two random doubles (guaranteed not to be pre-comp)
- Instruction:
random_double = random_double1/random_double2;
* - Time: 3184 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Square Root (Test 41)
This test determines the length of time required by the PIC32 chip to square root a 64-bit number (a double). Benchmarks (a) through (c) use the sqrt()
function while benchmarks (d) through (f) raise the operands to the 1/2 power.
- Test (a): Time required to sqrt() a constant double (may be pre-computed)
- Instruction:
double1 = sqrt(33359738444.0);
- Time: 188 ns
- Instruction:
- Test (b): Time required to sqrt() an int cast into a double (may be pre-computed)
- Instruction:
random_double = sqrt(random_double1);
- Time: 7998 ns
- Instruction:
- Test (c): Time required to sqrt() a random char (guaranteed not to be pre-comp)
- Instruction:
- Time: N/A ns *
- Test (d): Time required to ^(1/2) a constant char (may be pre-computed)
- Instruction:
- Time: N/A ns *
* We had some difficulty in trying to test these particular operations. It seems that the PIC32 is not capable of raising doubles to the 1/2 power.
Sine (Test 42)
This test determines the length of time required by the PIC32 chip to take the sine of an 64-bit number (a double).
- Test (a): Time required to take the sine of a constant doule (may be pre-computed)
- Instruction:
double1 = sin(33359738444.0);
- Time: 20299 ns
- Instruction:
- Test (b): Time required to take the sine of a random double (guaranteed not to be pre-comp)
- Instruction:
random_double = sin(random_double1);
* - Time: 20624 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Cosine (Test 43)
This test determines the length of time required by the PIC32 chip to get the cosine of one 64-bit number (a double).
- Test (a): Time required to take the cosine of a constant double (may be pre-computed)
- Instruction:
double1 = cos(33359738444.0);
- Time: 19762 ns
- Instruction:
- Test (b): Time required to take the cosine of a random double (guaranteed not to be pre-comp)
- Instruction:
random_double = cos(random_double1);
* - Time: 20011 ns
- Instruction:
*See Code section for a more details on how randomness was guaranteed.
Code
There is a c file for each test that was performed. They are included in a .zip at the bottom of this section. and can be downloaded for further review. Below is an example of the code, in this case, for testing multiplication of two chars.
/******************************************************************************* * PIC32 Benchmarking Test Suite * * Test 4 - char Multiplication Duration * Version 1.0 * Copyright (C) 2010 Todd H. Poole, Katy Powers, Max Willer * * This test determines the length of time required by the PIC32 chip to multiply * one 8-bit number (a char) by another 8-bit number (a char). * * A char data type, in ANSI C, is a value holding one byte, or one character * code. The actual number of bits in a char in a particular implementation is * documented as CHAR_BIT in that implementation's limits.h file. In practice, * it is almost always 8 bits, corresponding to a decimal range of 0 to 255, * inclusive. * * Given that there are many different ways to multiply one char by another, * we've done our best to include several different multiplication methods that * we feel are representative of normal coding practices. Depending on how the * source code is compiled, these different methods may or may not produce * different results. *******************************************************************************/ #include "HardwareProfile.h" #define PIN_A2 LATAbits.LATA2 //Define the specific pin we'll use for our tests char ascii_225; char ascii_200; char random_char; float zero_to_1; float zero_to_15; int int_zero_to_15; int random_int1; int random_int2; char random_char1; char random_char2; char larger; char smaller; int main(void) { SYSTEMConfigPerformance(SYS_FREQ); TRISAbits.TRISA2 = 0; //Setting our testing pin to be an output /*Seed the random number generator so we don't get the same 'random' * numbers over and over again. This is just to ensure that our * C-compiler doesn't try to pull any optimization tricks on us. */ srand(time(NULL)); //Finding our first random char /*Take the output of rand() - which is a number [0,RAND_MAX] - and * divide that output by RAND_MAX+1. We now have a number that ranges * from [0,1). Multiply this result by 16. This will give us a random * number in the range of [0,16). (I chose 16 because the maximum value * a char can be is 255. Having our range run from 0 to 16 exclusive will * ensure that our ints - and, later, our chars - will range from [0,15]. * Thus, regardless of what values our two ints/chars take, the final * multiplication will fit in a char, which range from 0 to 255 in decimal). * Cast this result into an int, which will truncate (cut off) the digits * after the decimal. Thus: * 0.00001 through 0.99999 will cast to 0. * 255.00001 through 255.99999 will cast to 255. * Then, cast our int from an int into a char. * PS: Yes, I know this doesn't give us a uniform probability distribution, but * we're not too concerned with true randomness... just enough to ensure nothing * is computed ahead of time by the C-compiler. */ zero_to_1 = rand() / RAND_MAX+1; //float that ranges from [0,1) zero_to_15 = zero_to_1 * 16; //float that ranges from [0,16) int_zero_to_15 = zero_to_15; //int that ranges from [0,15] random_int1 = int_zero_to_15; //our first random int random_char1 = int_zero_to_15; //our first random char //Finding our second random char zero_to_1 = rand() / RAND_MAX+1; //float that ranges from [0,1) zero_to_15 = zero_to_1 * 16; //float that ranges from [0,256) int_zero_to_15 = zero_to_15; //int that ranges from [0,255] random_int2 = int_zero_to_15; //our second random int random_char2 = int_zero_to_15; //out second random char /*By forcing all of our operations to depend on the results of randomly * generated numbers, we've guaranteed that our C-compiler won't be able to * interfere with our benchmarks by trying to compute our results in advance. * This ensures that all operations are performed on the fly by the PIC, and * that it's performance will be similar to what one might encounter in other * projects where all variables and data are not completely known in advance. */ //Time to start the actual benchmarking while(1) { /*We start our while loop off like this so that we can create an easily * recognizable pattern on the oscilloscope. We need to be able to * differentiate between the various tests, and so, after seeing this unique * pattern, we'll know that next test to follow will be the first test, the * test after that will be the second, etc. */ PIN_A2 = 1; PIN_A2 = 0; PIN_A2 = 0; PIN_A2 = 0; PIN_A2 = 1; PIN_A2 = 0; //Test 4(a) - Multiplication of Two Constant chars (may be pre-computed) PIN_A2 = 1; ascii_225 = 'K'*'?'; PIN_A2 = 0; //Test 4(b) - Multiplication of Two ints Cast into a char (may be pre-computed) PIN_A2 = 1; ascii_200 = 100*2; PIN_A2 = 0; //Test 4(c) - Mulitplication of Two Random chars (guaranteed not to be pre-comp) PIN_A2 = 1; random_char = larger*smaller; PIN_A2 = 0; PIN_A2 = 1; } }