Supporting data for "A draft genome sequence of the elusive giant squid, Architeuthis dux"

  1. Rute, Da Fonseca R.
  2. Alvarina, Couto
  3. André, Machado M.
  4. Brona, Brejova
  5. Caroline, Albertin B.
  6. Filipe, Silva
  7. Paul, Gardner
  8. Tobias, Baril
  9. Alex, Hayward
  10. Alexandre, Campos
  11. Angela, Ribeiro M.
  12. Inigo, Barrio-Hernandez
  13. Henk-Jan, Hoving
  14. Ricardo, Tafur-Jimenez
  15. Chong, Chu
  16. Bárbara, Frazão
  17. Bent, Petersen
  18. Fernando, Penaloza
  19. Francesco, Musacchia
  20. Alexander Jr. C. Graham
  21. Hugo, Osório
  22. Winkelmann Inger
  23. Simakov Oleg
  24. Simon, Rasmussen
  25. Rahman Ziaur, M.
  26. Davide, Pisani
  27. Erich, Jarvis D
  28. Guojie, Zhang
  29. Jakob, Vinther
  30. Jan, Strugnell M.
  31. C. Castro Filipe, L.
  32. Fedrigo Olivier
  33. Mateus, Patricio
  34. Qiye, Li
  35. Sara, Rocha
  36. Agostinho, Antunes
  37. Yufeng, Wu
  38. Bin, Ma
  39. Remo, Sanges
  40. Tomas, Vinar
  41. Blagoy, Blagoev
  42. Sicheritz-Ponten Thomas
  43. Rasmus, Nielsen
  44. M.Thomas, Gilbert P
  45. Mostrar todos os autores +

Editor: GigaScience Database

Ano de publicación: 2020

Tipo: Dataset

CC0 1.0

Resumo

The giant squid (Architeuthis dux; Steenstrup, 1857) is an enigmatic giant mollusk with a circumglobal distribution in the deep ocean, except in the high Arctic and Antarctic waters. The elusiveness of the species makes it difficult to study. Thus, having a genome assembled for this deep-sea dwelling species will allow unlocking several pending evolutionary questions. We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long-reads and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from three different tissue types from three other species of squid species (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein coding genes supported by evidence and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome. This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.