LinuxQuestions.org
Help answer threads with 0 replies.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices


Reply
  Search this Thread
Old 03-31-2009, 12:59 AM   #1
Azazwa
Member
 
Registered: Mar 2009
Distribution: Fedora 8, Ubuntu 9.10, SuSE 10, Fedora 14
Posts: 98

Rep: Reputation: 15
DHCP, OSCAR cluster installation, VMware server and PXE-boot problem.


Hi!

I seem to have a talent for doing things at the wrong time, in the wrong order or just in general messing things up. Sigh...

I am installing a little cluster with OSCAR 5.1rc (which I have to do for a postgrad chemistry project. Yes, the connection between clusters and chemistry isn't immediately apparent, but it's there...)
At the second last step, when the cluster should be completed, it failed. There were problems with torque and maui finishing the configuration. I have a server node, and added one normal compute node.

Then, perhaps because my supervisor was wanting to see some progress, and because I had a few hours in which I couldn't continue with other work, I loaded VMware server 2.0 on the server node as we have a small partition for XP. (Running Fedora 8 on the larger partition).

Now, when I wanted to add some more nodes to the cluster, they don't manage to boot by way of the network. This is what I get on the nodes:

PXE-E11 : ARP timeout
PXE-E11 : ARP timeout
PXE-E38 : TFTP cannot open connection
PXE-M0F : Exiting Intel Boot Agent

So I guess the trouble-maker is the VMware and the DHCP. The VMware install said the following:

Code:
This system appears to have a DHCP server configured for normal use.  Beware 
that you should teach it how not to interfere with VMware Server's DHCP server.
There are two ways to do this:

1) Modify the file /etc/dhcpd.conf to add something like:

subnet 10.0.0.0 netmask 255.0.0.0 {
# Note: No range is given, vmnet-dhcpd will deal with this subnet.
}

2) Start your DHCP server with an explicit list of network interfaces to deal 
with (leaving out vmnet1). e.g.:

dhcpd eth0

Consult the dhcpd(8) and dhcpd.conf(5) manual pages for details.
Perhaps the above makes everything crystal-clear to people who are not as clueless as I am. Firstly, I don't have a dhcpd.conf file in /etc. Now that I have loaded VMware, I have this /etc/vmware/vmnet8/dhcpd which looks like this:

Code:
#
# Configuration file for ISC 2.0b6pl1 vmnet-dhcpd operating on vmnet8.
#
# This file was automatically generated by the VMware configuration program.
# If you modify it, it will be backed up the next time you run the
# configuration program.
#
# We set domain-name-servers to make some DHCP clients happy
# (dhclient as configued in SuSE, TurboLinux, etc.).
# We also supply a domain name to make pump (Red Hat 6.x) happy.
#
allow unknown-clients;
default-lease-time 1800;		# 30 minutes
max-lease-time 7200;			# 2 hours

subnet 192.168.0.0 netmask 255.255.0.0 {
    range 192.168.128.0 192.168.255.254;
    option broadcast-address 192.168.255.255;
    option domain-name-servers 192.168.0.2;
    option domain-name "localdomain";
    option routers 192.168.0.2;
}

and similarly for vmnet1. Is there some other dhcpd file that I should change as shown in 1) above?

Secondly, I don't really know how to do 2). Is 2) a once off thing, or what?

Thirdly, I don't need VMware urgently until the cluster is up and running. Could I uninstall VMware, and just install it again later? Hm, but I'm afraid it might cause problems again. I would prefer to sort out the problems now, and not have some mysterious problems popping up later.

Any advice would be greatly appreciated!
 
Old 03-31-2009, 03:17 AM   #2
Azazwa
Member
 
Registered: Mar 2009
Distribution: Fedora 8, Ubuntu 9.10, SuSE 10, Fedora 14
Posts: 98

Original Poster
Rep: Reputation: 15
Hi! I uninstalled VMware, but now I am getting this:
PXE-E32: TFTP open timeout

My /etc/dhcpd.conf file (which I mandaged to find in the meantime)
looks like this.
Code:
####################################################################
# This dhcpd.conf file was generated by the systeminstaller command
# mkdhcpconf. It reflects the contents of the CLAMDR database.
# File generated at 9:51:26 on 3/31/2009
####################################################################

deny unknown-clients;
option subnet-mask 255.255.0.0;
option broadcast-address 192.168.255.255;
option domain-name "up.ac.za";
option routers 192.168.1.254;
ddns-update-style none; # For dhpcd version 3

# Defined cluster nodes...
subnet 192.168.0.0 netmask 255.255.0.0 {
	group {
		host normnode1{
			hardware ethernet 00:1C:C0:AF:10:01;
			fixed-address 192.168.1.1;
			filename "pxelinux.0";
			option routers 192.168.1.254;
			option domain-name "up.ac.za";
			next-server oscar_server;
		}
		host normnode2{
			hardware ethernet 00:1C:C0:AF:0F:FC;
			fixed-address 192.168.1.2;
			filename "pxelinux.0";
			option routers 192.168.1.254;
			option domain-name "up.ac.za";
			next-server oscar_server;
		}
		host normnode3{
			hardware ethernet 00:1C:C0:AF:0F:CF;
			fixed-address 192.168.1.3;
			filename "pxelinux.0";
			option routers 192.168.1.254;
			option domain-name "up.ac.za";
			next-server oscar_server;
		}
	}
}

# This entry ignores requests on eth1...
subnet 137.215.104.0 netmask 255.255.255.0 {
	not authoritative;
}
Would it make any difference to add

Code:
allow bootp;
allow booting;
before #Defined cluster nodes in the above file?

Any advice would be appreciated!
 
Old 03-31-2009, 06:50 AM   #3
Azazwa
Member
 
Registered: Mar 2009
Distribution: Fedora 8, Ubuntu 9.10, SuSE 10, Fedora 14
Posts: 98

Original Poster
Rep: Reputation: 15
I added the "allow's" and it didn't make any difference.
I'm rather depressed.
 
Old 03-31-2009, 10:52 AM   #4
Azazwa
Member
 
Registered: Mar 2009
Distribution: Fedora 8, Ubuntu 9.10, SuSE 10, Fedora 14
Posts: 98

Original Poster
Rep: Reputation: 15
Solved the problem

For the PXE-E32: TFTP open timeout problem, disable your firewall properly. That is, don't do it via the gui. Use #service iptables stop, and check these sites

http://www.dbapool.com/forumthread/topic_1069.html
http://forums.fedoraforum.org/archiv...p/t-31587.html

Hope it helps for those who have the same error as I had.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
OSCAR Cluster Setup for Video Application? your_shadow03 Linux - Newbie 0 08-13-2008 12:49 AM
DHCP server / PXE boot freeindy Linux - Networking 2 03-04-2008 02:31 AM
Why my pxe clinet doesnt get ip address from DHCP server? maya_mason Red Hat 1 11-27-2006 08:16 PM
Problems with Oscar cluster installation pankajrodge123 Linux - Newbie 3 05-05-2006 02:52 AM
Setting up a cluster with pxe and cluster knoppix bucovaina78 Linux - Networking 1 05-15-2005 11:29 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie

All times are GMT -5. The time now is 11:53 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration