fbpx

Get 15% off on your first assignment order and best assignment writing service for HND AssignmentsOrder Now

Have Any Question?

UK +4474648-84564

Free Support

hndassignmenthelp@gmail.com

Get Cloud Computing Assignment help by HND Assignment Help

0 Comments

The coursework involves writing a technical report (90%) and developing an application (10%) using Hadoop and MapReduce in the context of a Bioinformatics use case. Listing five AWS modules that you complete from the AWS Academy Cloud Foundations course will contribute 20% as part of the report. You DO NOT need previous Bioinformatics knowledge for this coursework. Input files are provided in plain text and are tab-delimited in Cloud Computing.

Background for the application

A biopharmaceutical company requires a computational analysis platform to compare various bacterial organisms based on their features. Your task is to investigate how cloud resources can be utilised to develop a scalable application to efficiently analyse several organisms as and when required. As the lead developer, you have been given seven example files (see the Appendix). Once developed, the application can be used to compare and analyse thousands of organisms.

Each line in these files includes an identifier for a protein in the second column and information about the protein’s function in the fifth column (Figure 1). A protein’s function is specified via a related Gene Ontology (GO) term. Figure 1 shows example rows for the P0A9Q1 protein. The GO:0003677 term in the “GO ID” column (5th column) indicates the “DNA binding” function. Hence, it can be concluded that the P0A9Q1 protein can bind to DNA.

Get Cloud Computing Assignment help by HND Assignment Help

1.1 Part I – Application (10%)

You will create an application using Hadoop and MapReduce for the following task using the Java programming language. You will use the files provided as input (see the Appendix for these files). You are allowed to use Python 3.

The GO:0030420 term (“establishment of competence for transformation”) and its subterms are used to indicate whether a bacteria can have the competence feature, which is about being transformed to take up exogenous genetic materials. Identify which organisms (using the provided seven input files) may exhibit “competence” as a physiological state. Hint: In the context of this coursework, the existence of the GO:0030420 term, and its subterms, in a file suggests that the corresponding organism may exhibit the competence feature. Therefore, you can count the number of GO assignments for the GO:0030420 term and related subterms for each file.

Cloud Computing
Figure 1. Example rows for the P0A9Q1 protein. The first row includes “GO:0003677” in the 5th column and “UniProtKB” in the 15th column.

1.1  Part II – Technical report (90%)

For part two of this coursework, your task is to produce a report of approximately 2500 words. The report will cover the following topics.

  1. Design a native cloud application for an image web service (50%). This item will include your design and report from the AWS workshop (Workshop 2). You can only refer to AWS resources covered in the AWS Academy Cloud Foundations course.
    1. Describe how IaaS, PaaS, SaaS concepts are related to the application you will develop when using the AWS resources. Explain the IaaS, PaaS and SaaS concepts by giving real-world examples from existing cloud providers and the literature.
    1. Provide an overview diagram of your design.
    1. Describe your design for the image web service and its REST interface for different access and search options.
    1. Describe how you plan to integrate different AWS products for this image web service considering cloud-related characteristics (e.g. storage, scalability and so on).
  2. Describe and evaluate the bioinformatics application (see section 1.1) that you will develop

(10%).

  • Describe your source and output files and folders and how they should be used.
    • Critically evaluate design aspects related to your solution, present and evaluate your findings (see section 1.1).
  • Analysis and discussion for future cloud integration of the bioinformatics application (see section 1.1) (10%). Identify and describe an existing cloud-based and distributed file processing system. Explain how you can integrate the bioinformatics application you will develop (see section 1.1) with this cloud-based system in the future. Discuss the advantages and disadvantages of migrating your application to the cloud.
  • AWS Academy Cloud Foundations course (20%). List five AWS modules that you complete from the online AWS Academy Cloud Foundations course, together with your score for each module. The course is accessible from the link below. Invitation emails were sent at the beginning of the semester. To successfully complete an AWS module, your score must be 80% or more. https://awsacademy.instructure.com/courses/12799

Input files

Table 1. List of bacterial organisms and corresponding input files. The files are available from the KLE assessment page.

 OrganismInput File
1Escherichia coli K-12Escherichia_coli_K-12_ecocyc_83333.gaf
2Bacillus subtilis 168Bacillus_subtilis_168-224308.gaf
3Bacillus amyloliquefaciens FZB42Bacillus_amyloliquefaciens_FZB42-326423.gaf
4Bacillus licheniformis ATCC 14580Bacillus_licheniformis_ATCC_14580-279010.gaf
5Bacillus megaterium DSM 319Bacillus_megaterium_DSM_319-592022.gaf
6Geobacillus kaustophilus HTA426Geobacillus_kaustophilus_HTA426-235909.gaf
7Geobacillus thermodenitrificans NG80Geobacillus_thermodenitrificans_NG80_2-420246.gaf

File format

Data are available in the Gene Annotation File (GAF) format, which defines tab-delimited columns to represent information, as shown below.

Table 2. The flat-file format is comprised of 17 tab-delimited fields. More details can be found at http://geneontology.org/docs/go-annotation-file-gaf-format-2.0.

ColumnContentRequired?CardinalityExample
1DBrequired1UniProtKB
2DB Object IDrequired1P12345
3DB Object Symbolrequired1PHO3
4Qualifieroptional0 or greaterNOT
5GO IDrequired1GO:0003993
6DB:Reference (|DB:Reference)required1 or greaterSGD_REF:S000047763
7Evidence Coderequired1IMP
8With (or) Fromoptional0 or greaterGO:0000346
9Aspectrequired1F
10DB Object Nameoptional0 or 1Toll-like receptor 4
11DB Object Synonym (|Synonym)optional0 or greaterhToll
12DB Object Typerequired1protein
13Taxon(|taxon)required1 or 2taxon:9606
14Daterequired120090118
15Assigned Byrequired1SGD
16Annotation Extensionoptional0 or greaterpart_of(CL:0000576)
17Gene Product Form IDoptional0 or 1UniProtKB:P12345-2

Get Cloud Computing Assignment help by HND Assignment Help

Students can avail of assignment help services for Cloud Computing Assignments with the HND Assignment help team as they offer 100% plagiarism free content, on-time delivery and an affordable price range. Our experts have deep knowledge of the subject and can obtain high marks. Contact us today for assignment help and get a winning edge over fellow students.

Leave a Reply

Your email address will not be published. Required fields are marked *