LinuxQuestions.org
Latest LQ Deal: Latest LQ Deals
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Server
User Name
Password
Linux - Server This forum is for the discussion of Linux Software used in a server related context.

Notices


Reply
  Search this Thread
Old 06-29-2010, 05:34 PM   #1
antoniemail
LQ Newbie
 
Registered: Jun 2010
Posts: 8

Rep: Reputation: 0
clustering?


Hello,

I have 2 servers right now that use rsync to replicate files from one to the other. I am looking into clustering them now, for HA and redundancy. But does clustering mean there will be replication as well? Or will I still need to keep the rsync going?

I am running fedora servers right now. Any help would be appreciated. Thanks.
 
Old 06-30-2010, 02:33 AM   #2
Blue_Ice
Member
 
Registered: Jul 2006
Location: Belgium
Distribution: Debian, Fedora, CentOS, Windows
Posts: 361

Rep: Reputation: Disabled
That will depend on your cluster setup. If it is really only HA you are looking for, then you would only need heartbeat and drbd for example. In case you want a HPC, then you should also think about a director and cluster filesystem (like OCFS2, GFS, glusterfs, etc.).

But other configurations might be required, depending on the applications you run. Reading your post, you are probably talking about webservers and in that case the above solution will do the trick. When it comes to e.g. MySQL, you probably want to use replication functionality in MySQL itself in combination with heartbeat.
 
Old 06-30-2010, 04:04 AM   #3
czezz
Member
 
Registered: Nov 2004
Distribution: Slackware/Solaris
Posts: 924

Rep: Reputation: 43
Hi, I will attach my question to this topic:
let's say I have 2 identical machine. Can I create/make a cluster which mainly increase CPU performance ?
(I would use RHEL 5.x distro if this any matter)
 
Old 06-30-2010, 04:18 AM   #4
Blue_Ice
Member
 
Registered: Jul 2006
Location: Belgium
Distribution: Debian, Fedora, CentOS, Windows
Posts: 361

Rep: Reputation: Disabled
Quote:
Originally Posted by czezz View Post
Hi, I will attach my question to this topic:
let's say I have 2 identical machine. Can I create/make a cluster which mainly increase CPU performance ?
(I would use RHEL 5.x distro if this any matter)
What do you mean with increase CPU performance in this context? When you are talking about HPC (load balancing), you probably get what you want.
HPC devides the the load over multiple nodes in the cluster. The resources on the nodes will be used less, which causes an increase in the performance. Normally you build a HA cluster for the directors and behind those you have the cluster with the nodes that do the actual work. But it is also possible to put the director on the nodes itself.
 
Old 06-30-2010, 04:40 AM   #5
czezz
Member
 
Registered: Nov 2004
Distribution: Slackware/Solaris
Posts: 924

Rep: Reputation: 43
Well, it sounds like HPC (load balancing) is this what I am thinking about.
Current machine does not manage sometimes with tasks and this is related to CPU usage. I am thinking to convince my boss to buy another identical machine and make a cluster (HPC).

I am thinking now about test on small regular PC:
- generate task which load 100% of CPU on a single PC.
- make cluster HPC on 2x regular PC's and generate this same task as on a single machine - to see what is performance/time of executing.

Is that make sense ?
 
Old 06-30-2010, 05:25 AM   #6
Blue_Ice
Member
 
Registered: Jul 2006
Location: Belgium
Distribution: Debian, Fedora, CentOS, Windows
Posts: 361

Rep: Reputation: Disabled
Quote:
Originally Posted by czezz View Post
Well, it sounds like HPC (load balancing) is this what I am thinking about.
Current machine does not manage sometimes with tasks and this is related to CPU usage. I am thinking to convince my boss to buy another identical machine and make a cluster (HPC).

I am thinking now about test on small regular PC:
- generate task which load 100% of CPU on a single PC.
- make cluster HPC on 2x regular PC's and generate this same task as on a single machine - to see what is performance/time of executing.

Is that make sense ?
Well, not really... Load balancing is only usefull, when e.g. 100 clients connect to one server and max out your resources. But when you have 2 servers in load balancing configuration, the 100 clients will be devided over the 2 servers. This will make sure that your resources will not experience such a heavy load anymore. What you are looking for is, deviding processes over several servers (grid computing). Unfortunately, I have no experience with grids. I think your software should be designed for it as well. An example of grid computing is the SETI project to find extraterrestrial life...
 
Old 06-30-2010, 05:27 AM   #7
Blue_Ice
Member
 
Registered: Jul 2006
Location: Belgium
Distribution: Debian, Fedora, CentOS, Windows
Posts: 361

Rep: Reputation: Disabled
By the way if one process takes all the CPU time, you will definately have an issue with the hardware that is not suitable for the task. One process cannot be split up and handled by several computers.
 
Old 06-30-2010, 07:43 AM   #8
antoniemail
LQ Newbie
 
Registered: Jun 2010
Posts: 8

Original Poster
Rep: Reputation: 0
Thanks Blue Ice. I do have mysql replication right now on my webservers. So that with rsync, heartbeat and drbd I should be all set with a pretty good cluster? Will I need a fence or something to monitor the cluster or will the machines monitor each other?
 
Old 06-30-2010, 08:08 AM   #9
Blue_Ice
Member
 
Registered: Jul 2006
Location: Belgium
Distribution: Debian, Fedora, CentOS, Windows
Posts: 361

Rep: Reputation: Disabled
Quote:
Originally Posted by antoniemail View Post
Thanks Blue Ice. I do have mysql replication right now on my webservers. So that with rsync, heartbeat and drbd I should be all set with a pretty good cluster? Will I need a fence or something to monitor the cluster or will the machines monitor each other?
Heartbeat checks if the service on one server is running and if not it will start the service on the other server.
Drbd is used for realtime replication of the hdd's. Drbd in combination with a cluster file system will give you the ability to build a master/master active/active cluster. This means that if you make a change on one system, it will be automatically replicated to the other server. This is a great solution for fileservers and for the files of websites. Again this is not wise for applications like MySQL and OpenLDAP as they have their own systems to replicate data.
For load balancing you need a director (e.g.: ldirectord), this is used to devide the load between the configured servers.
When set up correctly, you can easily add more servers for better availability and performance.

It is always good to monitor your servers. After all you want to know when one of your nodes goes down. This is not very complicated as all nodes will have their own ip addresses. Soa monitoring system like nagios will do the trick.
Take good care that you set up your nodes in the same way. Things like different firewall settings on nodes can make you wonder why your cluster is not working properly.
 
Old 06-30-2010, 10:57 AM   #10
uten
LQ Newbie
 
Registered: Aug 2006
Distribution: debian, w2k
Posts: 11

Rep: Reputation: 1
Smile

I danger of pointing out the obvious:
To read up on the topic I would recommend cluster_(computing) and search for mosix, openmosix and linuxpmi at wikipedia. I would also recomend to experiment a bit with some of the live cd's specifically designed for clustering tasks to get an idea of what kind of design would benefit you most. And for the rest of us with just a general interest I think distcc, ccache and some monitoring tool is a nice introduction to the benefits and limitations of clustering.

 
Old 07-01-2010, 02:55 AM   #11
Blue_Ice
Member
 
Registered: Jul 2006
Location: Belgium
Distribution: Debian, Fedora, CentOS, Windows
Posts: 361

Rep: Reputation: Disabled
Quote:
Originally Posted by uten View Post
I danger of pointing out the obvious:
To read up on the topic I would recommend cluster_(computing) and search for mosix, openmosix and linuxpmi at wikipedia. I would also recomend to experiment a bit with some of the live cd's specifically designed for clustering tasks to get an idea of what kind of design would benefit you most. And for the rest of us with just a general interest I think distcc, ccache and some monitoring tool is a nice introduction to the benefits and limitations of clustering.

I wouldn't recommend OpenMosix as the project is closed since March 1, 2008. For mosix you have to pay, so that might be out of the question in some cases. I never heard of linuxpmi, so I am going to check that out.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Help on Clustering??? your_shadow03 Linux - Server 5 07-09-2008 12:16 AM
clustering >>? Ammad Linux - General 6 03-20-2008 09:28 PM
Clustering Atwin Fedora 1 04-04-2007 10:57 PM
Clustering anand_rhce Linux - Hardware 0 12-15-2004 06:01 AM
Clustering Stephanie General 5 01-29-2004 12:04 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Server

All times are GMT -5. The time now is 05:00 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration