Posts
 
Reputation
Joined
Last Seen
0 Reputation Points
Unknown Quality Score

No one has voted on any posts yet. Votes from other community members are used to determine a member's reputation amongst their peers.

0 Endorsements
~2K People Reached
Favorite Tags
Member Avatar for smandape

I am trying to do some web scraping on [this](http://www.ncbi.nlm.nih.gov/pmc/pmctopmid/) site. So far I have tried the following but the output it gives me is the source code of the site which is different than I expect. I am trying to follow [this](http://nsaunders.wordpress.com/2013/09/17/web-scraping-using-mechanize-pmid-to-pmcidnihmsid/#more-3520) link which has done webscraping in ruby. …

0
166
Member Avatar for smandape

Thank you for looking at my question. I am trying to solve this homework question. Consider the problem of sequencing genome by random reads. If G is the length of the entire sequence, L is the length of the read and n is the number of reads, then coverage is …

0
98
Member Avatar for smandape

Hello Experts, you have been of great help to me when it comes to XSLT. Here is another problem I have while I try to extract the data from RDF/XML files. I don't know how to do that as there are terms like dcterms defined in the XML file. They …

Member Avatar for smandape
0
348
Member Avatar for smandape

Hello Experts, Kinda new to perl. I am using perl web API to get the data. The error is "application/xml; charset=UTF-8". I used 'use utf8' but doesn't seem to work. The line it gets stuck looks like something like this [CODE] my @candidates = $c->bookmarks_for(start => 1, tag =>'pubmed'); [/CODE] …

Member Avatar for d5e5
0
175
Member Avatar for smandape

Hello Experts, I am trying to extract data from XML file which is as follows [CODE] <lineage> <taxon scientificName="Homo" taxId="9605" rank="genus"/> <taxon scientificName="Homininae" taxId="207598" rank="subfamily" hidden="true"/> <taxon scientificName="Hominidae" taxId="9604" rank="family"/> <taxon scientificName="Hominoidea" taxId="314295" rank="superfamily" hidden="true"/> <taxon scientificName="Catarrhini" taxId="9526" rank="parvorder"/> <taxon scientificName="Simiiformes" taxId="314293" rank="infraorder" hidden="true"/> <taxon scientificName="Haplorrhini" taxId="376913" rank="suborder"/> <taxon scientificName="Primates" …

Member Avatar for smandape
0
183
Member Avatar for smandape

Hello experts, I am trying to extract the abstract from the following XML code that appears as follow; [CODE] <abstract> <p> In eukaryotes, glutathione S-transferases (GSTs) participate in the detoxification of reactive electrophillic compounds by catalysing their conjugation to glutathione. GST is found as a domain in S-crystallins from squid, …

Member Avatar for iceandrews
0
110
Member Avatar for smandape

Hello experts, I am trying to extract the abstract from the following XML code that appears as follow; [CODE] <abstract> <p> In eukaryotes, glutathione S-transferases (GSTs) participate in the detoxification of reactive electrophillic compounds by catalysing their conjugation to glutathione. GST is found as a domain in S-crystallins from squid, …

Member Avatar for smandape
0
130
Member Avatar for smandape

Hello experts, I am trying to extract data from XML file using XSLT. I am trying to code a general XSLT code that can handle similar XML files that may differ a bit from each other. the XML code I am working on can have the following 4 scenarios for …

Member Avatar for xml_looser
0
265
Member Avatar for smandape

Hello Seniors, I am trying to extract an element from a string using the delimiter ';' in case it is in the middle of the string or nothing if it is the only element in the string..i am coding a generalized code that can extract the element from any such …

Member Avatar for smandape
0
112
Member Avatar for smandape

Hello experts, I am working on the following XML code [CODE]<GBSeq_source-db>UniProtKB: locus NR1I3_HUMAN, accession Q14994; class: standard. extra accessions:Q5VTW5,Q5VTW6 created: Jul 15, 1999. sequence updated: Jun 21, 2004. annotation updated: Feb 8, 2011. xrefs: Z30425.1, CAA83016.1, AL590714.27, CAH72153.1, CAH72154.1, CH471121.2, EAW52608.1, EAW52609.1, BC069626.1, AAH69626.1, A56197, NP_001070948.1, NP_005113.1, 1XV9_B, 1XV9_D, 1XVP_B, …

Member Avatar for smandape
0
144
Member Avatar for smandape

Here is the XML code i am trying to work on... [CODE]<GBSeq_definition>RecName: Full=Solute carrier family 2, facilitated glucose transporter member 1; AltName: Full=Glucose transporter type 1, erythrocyte/brain; Short=GLUT-1; AltName: Full=HepG2 glucose transporter</GBSeq_definition>[/CODE] I want to extract the elements such that I get the output as [CODE]<aliases> Glucose transporter type 1, …

Member Avatar for smandape
0
225