Titulo Estágio
Security and performance in Cloud Data Warehouses
Áreas de especialidade
Engenharia de Software
Local do Estágio
DEI-FCTUC
Enquadramento
We pretend to answer the following question: “Is the cloud the right place for data warehousing?” Recently Amazon has launched its newest service, Redshift, a cloud-based data warehouse tool.
Cloud Data Warehousing is a potential cost savings for big companies, and removing a cost barrier that have held data warehousing back from small and mid-sized businesses. Cloud Data Warehouse must be designed to take away the undifferentiated heavy lifting of running infrastructure at heavy scale, and this allows the customers to focus on their core competencies.
However there are the new players in this area: EMC and VMware made somewhat of a splash recently when the companies announced their Pivotal Initiative, a combination of big data and cloud-based technologies from each of the companies. Google, with its BigQuery service, is another player to watch in this space and Kognitio, a European data management and BI platform, has made some rumblings about cloud-based data warehousing and others.
Objetivo
In this project we pretend to analyze all the aspects of Cloud Data Warehousing, putting the stress on the integration of a Cloud DW solution within organizations. Also and more important, the opportunity of using a Cloud DW solution is analyzed in contrast with that of using a traditional DW solution. An important point is to evaluate the various players in the market with special focus on security and performance issues. At the end, a prototype implementation of a Cloud Data Warehouse for a specific environment should be proposed.
Plano de Trabalhos - Semestre 1
[Some tasks might overlap; M=Month]
T1 (M1 – M3): State of the art literature review on Cloud Data Warehousing.
T2 (M3) Design of an architecture model, using the information gathered in task T1 as basis.
T3 (M3) Identification of target systems to be used in the experiments.
T4 (M3 – M4) Implementation of a proof of concept prototype.
T5 (M5): Writing the Intermediate report.
Plano de Trabalhos - Semestre 2
[Some tasks might overlap; M=Month]
T6 (M6): Integration of the intermediate defense comments and completion of the architecture model.
T7 (M6 – M7): Implementation of a prototype, and execution of tests (functional).
T8 (M8): Execution of experiments and analysis of results.
T9 (M9): Write a research paper and submission to a top international conference on the area (IEEE International Conference on Cloud Computing, Database Systems for Advanced Applications - DASFAA, IEEE International Conference on Data Engineering – ICDE, etc.).
T10 (M10): Writing the thesis.
Condições
The work will be carried out in the facilities of the Department of Informatics Engineering at the University of Coimbra (CISUC - Software and Systems Engineering Group), where a work place and necessary computer resources will be provided.
Observações
A scholarship may be available (value to be defined) for at least part of the duration of the internship.
Orientador
Jorge Bernardino
jorge@isec.pt 📩