Linux - GeneralThis Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Recently, I started downloading data from a website that shows houses for sale. My goal is to create a website on which people can find out about the pricing over the course of time and the length of time a house remains available.
I have created a MySQL database, which is currently being fed each night. At this point, I have enough data records to start analyzing and mining this data.
Now I have come to the part of starting a datawarehouse. I love to use a data modelling tool, like the one from MySQL. Yet, I have a hard time finding some documentation over data modelling a data vault datawarehouse using open source tools.
Pentaho looks promising, but I need to keep this low-budget. So low-budget it needs to be free. MySQL Workbench has a moddelling tool as well. Yet it does not support a data vault scheme.
Has anyone any experience in setting up a datawarehouse? Or in particular a data vault datawarehouse using open source tools?
If you have, could you please point me out to some pitfalls and other best-practices?
Yours sincerely,
Mark
P.S. Couldn't really find a forum where to post this. So I hope I put this in the right forum. Correct me if I'm wrong.
Our product called Quipu generates a data vault datamodel for you datawarehouse. Quipu also generates the required DDL statements and loadcode to bring your source data into the datavault.
Cannot post a url here, so you need to do a search on google on "quipu open source datavault datawarehousemanagement"
Good luck!
Just noticed your registration on our site indeed! Someren and Oirschot are close, it's a small world...
FYI: Quipu backend runs on Ubuntu on JVM. Connects perfectly to MySQL db (using JDBC), for both source and target (datawarehouse) db. Also, Quipu repository can be set up in MySQL.
We have a customer integrating Pentaho Kettle with Quipu, linking to our repository where Quipu generated code is being scheduled and executed using Kettle.
At the moment, I am working with Informatica Powercenter. Just ETL, no reporting yet. I have only had some courses in reporting (Business Objects, WebFOCUS, and such). Would love to try some reporting using your product to set up the datawarehouse.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.