Data and Text Processing for Health and Life Sciences (Record no. 1017)
[ view plain ]
000 -LEADER | |
---|---|
fixed length control field | 04525nam a22005775i 4500 |
001 - CONTROL NUMBER | |
control field | 978-3-030-13845-5 |
003 - CONTROL NUMBER IDENTIFIER | |
control field | DE-He213 |
005 - DATE AND TIME OF LATEST TRANSACTION | |
control field | 20210511120753.0 |
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION | |
fixed length control field | cr nn 008mamaa |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
fixed length control field | 190610s2019 gw | s |||| 0|eng d |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 9783030138455 |
-- | 978-3-030-13845-5 |
024 7# - OTHER STANDARD IDENTIFIER | |
Standard number or code | 10.1007/978-3-030-13845-5 |
Source of number or code | doi |
050 #4 - LIBRARY OF CONGRESS CALL NUMBER | |
Classification number | RC261-271 |
072 #7 - SUBJECT CATEGORY CODE | |
Subject category code | MJCL |
Source | bicssc |
072 #7 - SUBJECT CATEGORY CODE | |
Subject category code | MED062000 |
Source | bisacsh |
072 #7 - SUBJECT CATEGORY CODE | |
Subject category code | MJCL |
Source | thema |
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER | |
Classification number | 614.5999 |
Edition number | 23 |
100 1# - MAIN ENTRY--PERSONAL NAME | |
Personal name | Couto, Francisco M. |
Relator term | author. |
Relationship | aut |
-- | http://id.loc.gov/vocabulary/relators/aut |
9 (RLIN) | 4950 |
245 10 - TITLE STATEMENT | |
Title | Data and Text Processing for Health and Life Sciences |
Medium | [electronic resource] / |
Statement of responsibility, etc. | by Francisco M. Couto. |
250 ## - EDITION STATEMENT | |
Edition statement | 1st ed. 2019. |
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE | |
Place of production, publication, distribution, manufacture | Cham : |
Name of producer, publisher, distributor, manufacturer | Springer International Publishing : |
-- | Imprint: Springer, |
Date of production, publication, distribution, manufacture, or copyright notice | 2019. |
300 ## - PHYSICAL DESCRIPTION | |
Extent | XV, 98 p. 483 illus., 74 illus. in color. |
Other physical details | online resource. |
336 ## - CONTENT TYPE | |
Content type term | text |
Content type code | txt |
Source | rdacontent |
337 ## - MEDIA TYPE | |
Media type term | computer |
Media type code | c |
Source | rdamedia |
338 ## - CARRIER TYPE | |
Carrier type term | online resource |
Carrier type code | cr |
Source | rdacarrier |
347 ## - DIGITAL FILE CHARACTERISTICS | |
File type | text file |
Encoding format | |
Source | rda |
490 1# - SERIES STATEMENT | |
Series statement | Advances in Experimental Medicine and Biology, |
International Standard Serial Number | 0065-2598 ; |
Volume/sequential designation | 1137 |
505 0# - FORMATTED CONTENTS NOTE | |
Formatted contents note | Preface -- Introduction -- Resources -- Data Retrieval -- Text Processing -- Semantic processing -- Index. |
506 0# - RESTRICTIONS ON ACCESS NOTE | |
Terms governing access | Open Access |
520 ## - SUMMARY, ETC. | |
Summary, etc. | This open access book is a step-by-step introduction on how shell scripting can help solve many of the data processing tasks that Health and Life specialists face everyday with minimal software dependencies. The examples presented in the book show how simple command line tools can be used and combined to retrieve data and text from web resources, to filter and mine literature, and to explore the semantics encoded in biomedical ontologies. To store data this book relies on open standard text file formats, such as TSV, CSV, XML, and OWL, that can be open by any text editor or spreadsheet application. The first two chapters, Introduction and Resources, provide a brief introduction to the shell scripting and describe popular data resources in Health and Life Sciences. The third chapter, Data Retrieval, starts by introducing a common data processing task that involves multiple data resources. Then, this chapter explains how to automate each step of that task by introducing the required commands line tools one by one. The fourth chapter, Text Processing, shows how to filter and analyze text by using simple string matching techniques and regular expressions. The last chapter, Semantic Processing, shows how XPath queries and shell scripting is able to process complex data, such as the graphs used to specify ontologies. Besides being almost immutable for more than four decades and being available in most of our personal computers, shell scripting is relatively easy to learn by Health and Life specialists as a sequence of independent commands. Comprehending them is like conducting a new laboratory protocol by testing and understanding its procedural steps and variables, and combining their intermediate results. Thus, this book is particularly relevant to Health and Life specialists or students that want to easily learn how to process data and text, and which in return may facilitate and inspire them to acquire deeper bioinformatics skills in the future. |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Cancer research. |
9 (RLIN) | 4951 |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Human genetics. |
9 (RLIN) | 4952 |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Database management. |
9 (RLIN) | 4953 |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Data mining. |
9 (RLIN) | 541 |
650 14 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Cancer Research. |
Authority record control number or standard number | https://scigraph.springernature.com/ontologies/product-market-codes/B11001 |
9 (RLIN) | 4954 |
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Human Genetics. |
Authority record control number or standard number | https://scigraph.springernature.com/ontologies/product-market-codes/B12008 |
9 (RLIN) | 4955 |
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Database Management. |
Authority record control number or standard number | https://scigraph.springernature.com/ontologies/product-market-codes/I18024 |
9 (RLIN) | 4956 |
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Data Mining and Knowledge Discovery. |
Authority record control number or standard number | https://scigraph.springernature.com/ontologies/product-market-codes/I18030 |
9 (RLIN) | 546 |
710 2# - ADDED ENTRY--CORPORATE NAME | |
Corporate name or jurisdiction name as entry element | SpringerLink (Online service) |
9 (RLIN) | 141 |
776 08 - ADDITIONAL PHYSICAL FORM ENTRY | |
Relationship information | Printed edition: |
International Standard Book Number | 9783030138448 |
776 08 - ADDITIONAL PHYSICAL FORM ENTRY | |
Relationship information | Printed edition: |
International Standard Book Number | 9783030138462 |
773 ## - HOST ITEM ENTRY | |
Title | Springer Nature Open Access eBook |
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE | |
Uniform title | Advances in Experimental Medicine and Biology, |
International Standard Serial Number | 0065-2598 ; |
Volume/sequential designation | 1137 |
9 (RLIN) | 3063 |
856 40 - ELECTRONIC LOCATION AND ACCESS | |
Uniform Resource Identifier | <a href="https://doi.org/10.1007/978-3-030-13845-5">https://doi.org/10.1007/978-3-030-13845-5</a> |
912 ## - | |
-- | ZDB-2-SBL |
912 ## - | |
-- | ZDB-2-SXB |
912 ## - | |
-- | ZDB-2-SOB |
942 ## - ADDED ENTRY ELEMENTS (KOHA) | |
Koha item type | e-Books |
-- | Administrator Library |
No items available.