How to learn ASM

MTK358 · 03-06-2010, 11:32 AM

That worked, now it is at the formattion stage and it is stuck at 33% with no progress for a long time, the hard drive is very active, and the CPU usage is very low. Also Debian did not let me make the partition scheme the way I want it.

I am worried that there is something wrong.

EDIT: added screenshot

theNbomr · 03-06-2010, 11:47 AM

I saw the same effect. Patience.
--- rod.

MTK358 · 03-06-2010, 12:22 PM

It finally started working.

It's installing packages now.

Sergei Steshenko · 03-06-2010, 12:58 PM

Quote:

Originally Posted by MTK358

Same error.

Please explain to me why I can't just use the ARM install CD?

Because nobody has prepared it for you.

Have you tried to choose 'Cancel' ?

MTK358 · 03-06-2010, 01:14 PM

What do you mean?

Also, in the article, I saw this:

"Debian currently does not support the ARM Versatile platform; the support will be added post-Etch"

But it already is post-Etch!

MTK358 · 03-06-2010, 02:05 PM

Another error!!!

While installing software part of installation, it came up with an error that it failed to install something. The only choice was to hit Continue, this is what happened: (See attachment)

P.S. If there only was an ARM port of Arch Linux...

smeezekitty · 03-06-2010, 04:15 PM

I found ubuntu for ARM:http://www.ubuntu.com/products/whatisubuntu/arm

Sergei Steshenko · 03-06-2010, 04:36 PM

Quote:

Originally Posted by MTK358

Another error!!!

While installing software part of installation, it came up with an error that it failed to install something. The only choice was to hit Continue, this is what happened: (See attachment)

P.S. If there only was an ARM port of Arch Linux...

That might be a QEMU (I/O) problem. I think I used pretty ancient QEMU - like 0.9.8.

...

Maybe it's a true error of your host disk I/O ? Do you see anything related to disk I/O in /var/log/messages ?

And, by the way, you do not need QEMU in a separate window, for your tasks QEMU in a console might be more convenient.

MTK358 · 03-06-2010, 05:06 PM

Quote:

Originally Posted by smeezekitty

I found ubuntu for ARM:http://www.ubuntu.com/products/whatisubuntu/arm

That seems nice. So can I just download the cd and add the option "-cdrom *ubuntu*.iso" to qemu?

Sergei Steshenko · 03-06-2010, 05:14 PM

Quote:

Originally Posted by MTK358

That seems nice. So can I just download the cd and add the option "-cdrom *ubuntu*.iso" to qemu?

Can you positively confirm that QEMU provides you with

Quote:

For Ubuntu 9.10 , the two platforms we support are:

* Freescale i.MX51 Babbage boards (2.x boards are supported and 3.0 boards are untested but expected to mostly work)
* Marvell Dove boards (Y0 and Y1 boards)

?

MTK358 · 03-06-2010, 05:21 PM

So it will only run on those?

Sergei Steshenko · 03-06-2010, 07:21 PM

Quote:

Originally Posted by MTK358

So it will only run on those?

I first give documentation benefit of the doubt. I.e. why try unless you can confirm ?

theNbomr · 03-06-2010, 08:08 PM

Quote:

Originally Posted by MTK358

Another error!!!

While installing software part of installation, it came up with an error that it failed to install something. The only choice was to hit Continue, this is what happened: (See attachment)

I just installed the QEMU-ARM system, and it took me a three tries before it worked. My feeling is that there are network errors that don't normally happen outside the emulator, as I had a few packages that it couldn't download. I said 'Continue', hoping it wouldn't matter, but evidently, it did matter. Perhaps persistence is the solution.

--- rod.

ghostdog74 · 03-06-2010, 08:38 PM

Quote:

Originally Posted by MTK358

I wanted to try learning Assembler

I have just one question for you. what is your goal in the end?

johnsfine · 03-07-2010, 11:02 AM

In case the OP, or anyone reading this thread later, wants to see the same function (raise an integer to an integer power) as an example in the way I suggest learning asm (64 bit and mixing C and asm) here is that code:

You can compile the C and asm together with
gcc test.c foo.s
then run the result with
./a.out

The C function test.c just calls the asm function:

Code:

void foo(unsigned int a, unsigned int b);
int main(int argc, char**argv)
{
   foo(3, 30);
   return 0;
}

The asm function foo.s computes the 64 bit unsigned long of a to the power b, where a and b are 32 bit unsigned values. Then it calls printf to display the result.

Code:

        .section .rodata
message:
        .string "%d ** %d = %ld\n"
        .text
.globl foo
# on entry
#    rdi = base
#    rsi = exponent
foo:
        pushq   %rbp
        movq    %rsp, %rbp
        movl    $1, %ecx       # Same behavior as movq $1,%rcx
        movl    %esi, %edx     # Save Exponent for printf
        testl   %esi, %esi
        jz      2f             # Skip the loop if exponent is zero
1:      imulq   %rdi, %rcx     # rcx *= rdi
        decl    %esi
        jne     1b
2:                             # rcx already has fourth parameter for printf
                               # rdx already has third parameter for printf
        movl    %edi, %esi     # Second parameter for printf
        movq    $message, %rdi # First parameter for printf
        xorl    %eax, %eax     # rax = 0 # Number of SSE registers in parameter list
        call    printf
        leave
        ret

A few details that a beginner would need explained:

1) No parameters nor locals were on the stack. So the conventional pushq %rbp and movq %rsp, %rbp at the beginning balanced by leave at the end, serve little purpose. But we can't simply omit them with no other changes, because the ABI requires the stack to be 16 byte aligned before each call. The stack was 16 byte aligned when main called foo() but then is 8 bytes off of aligned because of the return address pushed. So we need to push an odd number of 8 byte objects before calling printf. The push of rbp at the beginning of each function is usually used as that odd item, so later pushes or stack allocations if any would be an even number of 8 byte items.

But a good asm programmer might notice the opportunity for a "function tail merge". When the last thing you do before returning is call a function, if the stack and register usage are compatible you can jump to the function instead of calling it and then returning. In 32 bit x86, function tail merge is rarely possible unless the signatures of the two functions are nearly identical. But the 64 bit ABI is more powerful, so the function tail merge is easy here even though only two parameters were passed to foo() while four are passed to printf(). So the easier version of foo.s is

Code:

        .section .rodata
message:
        .string "%d ** %d = %ld\n"
        .text
.globl foo
# on entry
#    rdi = base
#    rsi = exponent
foo:
        movl    $1, %ecx       # Same behavior as movq $1,%rcx
        movl    %esi, %edx     # Save Exponent for printf
        testl   %esi, %esi
        jz      2f             # Skip the loop if exponent is zero
1:      imulq   %rdi, %rcx     # rcx *= rdi
        decl    %esi
        jne     1b
2:                             # rcx already has fourth parameter for printf
                               # rdx already has third parameter for printf
        movl    %edi, %esi     # Second parameter for printf
        movl    $message, %edi # First parameter for printf
        xorl    %eax, %eax     # rax = 0 # Number of SSE registers in parameter list
        jmp     printf

2) Any time the destination of an instruction is one of the 32 bit general registers, the CPU will clear the high half of the 64 bit register. So in several places where I wanted a 32 bit unsigned value in a 64 bit register, I just moved or computed the 32 bit value into the 32 bit register (which is a shorter and sometimes faster instruction than moving or computing the 64 bit value) and relied on the upper half being cleared.

3) A good asm programmer plans ahead for register use. The ABI specifies the first six integer or pointer parameters go in registers rdi, rsi, rdx, rcx, r8, and r9. I needed to select a place to save the exponent (because the loop destroys the original exponent) and select a place to compute the result. Since those would be the third and fourth parameters in the call, I selected rdx and rcx.

4) The instruction movl $message, %edi treats the address message as a 32 bit unsigned quantity. But in x86_64, addresses are 64 bit. So this instruction only works right if the program is linked so that all its pre initialized read only data is in first 4GB of the address space. That is a safe assumption for ordinary user mode programs in Linux.

5) When calling any function, such as printf, that takes a variable parameter list, the ABI requires that the caller put in the AL register the number of parameters which have been passed in SSE registers. My function is not passing (nor even using) any SSE registers. So I need to put 0 in al before calling printf. The instruction xorl %eax, %eax clears all of rax and is generally the fastest way to clear part or all of rax. (al is the lowest byte of rax).

6) You should read the as documentation for the use of jump targets such as the 2f and 1b I used in this example. In this example they connect to the 2: and 1: labels.