Feeds: Quality of metadata
I have tested the feeds of different journals in my field.For some of them, the main metadata displayed in the columns in the centre pane is extracted nicely (Title, Creator, Date, Publication, ...).But in most cases, a large part of the metadata extracted in not correctly attributed to the different fields, usually going all together to the field "Abstract".In the worst case, it could only get the title correctly.
188BET靠谱吗Is there a way to improve the metadata extraction for each journal feed, as with the Zotero Connector?
Or is the problem coming from the publisher of the feed?In that case, what is the best way to report to them the problem?188BET靠谱吗Is there any standard that they should follow so that it works nicely with Zotero?
188BET靠谱吗Is there a way to improve the metadata extraction for each journal feed, as with the Zotero Connector?
Or is the problem coming from the publisher of the feed?In that case, what is the best way to report to them the problem?188BET靠谱吗Is there any standard that they should follow so that it works nicely with Zotero?
Then the problem is from the publishers.188BET靠谱吗It seems that they format the data themselves to look ok in some feed readers, without filling correctly the feed metadata that it used by Zotero.
I will contact some publishers to see if they can fix their feeds.
188BET靠谱吗I can probably use the feed of Journal of Fluid Mechanics as a good example to follow (except the inlineFormula that cannot be displayed in Zotero) to explain what is needed:
https://www.cambridge.org/core/rss/product/id/1F51BCFAA50101CAF5CB9A20F8DEA3E4
Elsevier seems to use a random formatting to put all the metadata in the
description
element, without filling the standard metadata elements.For example, the RSS feed of the Journal of Computational Physics: https://rss.sciencedirect.com/publication/science/00219991
For this AIP journal Physics of Fluids, they also use their own formatting for the information put in
description
:https://aip.scitation.org/action/showFeed?type=etoc&feed=rss&jc=phf
For the Springer journal Experiments in Fluids, they do not mention the authors at all.
https://link.springer.com/search.rss?facet-content-type=Article&facet-journal-id=348&channel-name=Experiments in Fluids
188BET靠谱吗Items inside the RSS feed "collections" are not yet fully accepted / integrated into your true Zotero library collections.188BET靠谱吗Zotero feeds are great for capturing newly published articles.188BET靠谱吗However, Zotero recognizes that seldom will a user want to always add to their library everything in every journal issue.188BET靠谱吗Zotero lets you select only the items you need and allows you to place the items in the collection you choose.
https://www.crossref.org/wp/labs/whitepapers/rss-best-practice/
But this is quite old, and the link to the PRISM Module is not working anymore:
http://www.prismstandard.org/resources/mod_prism.html
Is there anything more recent on the standard way to implement RSS Feeds for scientific journals?
I could find also this page, but also dead now:
https://idealliance.org/specifications/prism-metadata/
The Elsevier support directed me to this page, saying that their RSS Feeds work perfectly well in these RSS readers:
https://service.elsevier.com/app/answers/detail/a_id/10818/supporthub/sciencedirect/kw/RSS
188BET靠谱吗They obviously do not want to support Zotero, but I would like to have the correct information on the standards to argue that they should fix their RSS Feeds.
Has anyone tried to contact Elsevier to ask them to provide proper RSS Feeds?
Most of my attempts to contact different publishers to fix their RSS Feeds have failed so far...
description
field, but it's just a block of text that they're dumping there.188BET靠谱吗Zotero displays it as well, in Abstract.The main thing we can do is preserve formatting and newlines when displayingdescription
, since right now we're just stripping the HTML tags.We'll look into doing that.But that won't help with the center columns, of course.We could add some custom hard-coded rules to try to parse known lines out ofdescription
from specific publishers, but that would be kind of ridiculous — this is a format with predefined fields, and the publishers should use them.188BET靠谱吗(Again, though, this doesn't affect what actually gets saved to Zotero.)
For example, the following item:
"Deep Reinforcement Learning-Augmented Spalart–Allmaras Turbulence Model: Application to a Turbulent Round Jet Flow"
Contains the fields:
- prism:volume
- prism:number
- prism:startingPage
188BET靠谱吗However, they are not displayed in Zotero:
188BET靠谱吗https://s3.amazonaws.com/zotero.org/images/forums/u265723/0oz7rvazyw3cdbbnwlvv.png
188BET靠谱吗Is there a problem in the RSS Feed, or are these fields not supported in Zotero?
And just to note here, we added support for rendering HTML in abstracts last month.
I can now see the Volume, Number and Page:
188BET靠谱吗https://s3.amazonaws.com/zotero.org/images/forums/u265723/yy53j1d0o09sjpzqrl30.png
But my feeds from the American Physical Society got broken on the way: they cannot be refreshed anymore:
188BET靠谱吗https://s3.amazonaws.com/zotero.org/images/forums/u265723/irqm3vivu1wti0ntnrjw.png
[JavaScript Error: "Error processing feed fromhttp://feeds.aps.org/rss/recent/prfluids.xml:
TypeError: feedText.createDocumentFragment is not a function"]
[JavaScript Error: "Error processing feed fromhttp://feeds.aps.org/rss/tocsec/PRE-Fluiddynamics.xml:
TypeError: feedText.createDocumentFragment is not a function"]
[JavaScript Error: "Error processing feed fromhttp://feeds.aps.org/rss/tocsec/PRL-NonlinearDynamicsFluidDynamicsClassicalOpticsetc.xml:
TypeError: feedText.createDocumentFragment is not a function"]
They still look fine in Feedly.
And the other Feeds can be refreshed.
Debug ID D357392977
188BET靠谱吗Zotero 7.0.0-beta.81+721f54fe4 (64-bit)
Windows 10
188BET靠谱吗https://s3.amazonaws.com/zotero.org/images/forums/u265723/84qw7effhkx1fojvm38n.png