Why does Scrivener re-load web items?

ph
philipt18
Posts: 26
Joined: Thu Sep 23, 2010 8:10 pm
Platform: Mac

Tue Aug 18, 2015 12:14 pm Post

When you drag a web page into Scrivener, it defaults to saving it as a Web Archive. If that's the case, then why does it need to access the net when viewing that same page in Scrivener later? It seems to be loading images from the original web site instead of using what it downloaded originally. If I had saved it as a Web Archive and then dragged that into Scrivener, presumably it wouldn't do that. My question is why is it doing it when I drag a page into Scrivener?

Thanks.

User avatar
KB
Site Admin
Posts: 20763
Joined: Tue Jun 13, 2006 11:23 pm
Platform: Mac
Location: Truro, Cornwall
Contact:

Tue Aug 18, 2015 7:48 pm Post

I'm afraid I don't actually know the answer - Scrivener just uses Apple's WebKit (the same used by Safari) for displaying and saving web archives. The behaviour is determined by WebKit itself - certainly there's nothing in Scrivener's code that does this; all Scrivener does is call a load request on the file URL; it has no knowledge of the original web page itself.
"You can't waltz in here, use my toaster, and start spouting universal truths without qualification."

User avatar
AmberV
Posts: 24307
Joined: Sun Jun 18, 2006 4:30 am
Platform: Mac + Linux
Location: Ourense, Galiza
Contact:

Tue Aug 18, 2015 7:53 pm Post

Scrivener supports importing and viewing the WebArchive format via the Web Kit component of the Mac, whatever that format does or does not do correctly is a problem with the format and the viewer that has been supplied to use for working with it. I’ve never been a big fan of it for this reason, and for the reason that I generally do not want the entire page anyway—all of the data mining tracking bugs, Flash ads that make Web Kit unstable and a security nightmare and even the navigation components around the content I intend to archive. I want to trim all of that junk out, and so for my purposes importing as text is the best solution.

And yes, some legitimate stuff can end up not being archived as well, it depends on how the page was coded. Increasingly, web designers create web pages that are all but inoperable all by themselves. The browser has to go and fetch data from potentially dozens of different web sites, for everything from CSS to Javascript to images, just to put it all together into a form that most people will recognise. I believe WebArchive does some encapsulation, but not everything will get imported.
.:.
Ioa Petra'ka
“Whole sight, or all the rest is desolation.” —John Fowles

ph
philipt18
Posts: 26
Joined: Thu Sep 23, 2010 8:10 pm
Platform: Mac

Wed Aug 19, 2015 5:39 am Post

Yes, but the whole point of a web archive is that it's off-line. Scrivener for some reason is trying to load elements from the original page online, which is what doesn't make sense.

User avatar
KB
Site Admin
Posts: 20763
Joined: Tue Jun 13, 2006 11:23 pm
Platform: Mac
Location: Truro, Cornwall
Contact:

Wed Aug 19, 2015 12:49 pm Post

philipt18 wrote:Yes, but the whole point of a web archive is that it's off-line. Scrivener for some reason is trying to load elements from the original page online, which is what doesn't make sense.


Yes, but as we have both said, that's down to Apple's WebKit, nothing to do with Scrivener. You can always covert them to text or import the files as PDF instead.
"You can't waltz in here, use my toaster, and start spouting universal truths without qualification."

Br
Briar Kit
Posts: 1787
Joined: Thu Apr 04, 2013 9:04 am
Platform: Mac

Wed Aug 19, 2015 4:07 pm Post

As KB and Ioa say, it's an Apple issue.

You can default webarchives to open with TextEdit. Not pretty, but they *shouldn't* update when loaded in the external editor (TextEdit), as they do when loaded in Safari.
Account closed January 2017

User avatar
nom
Posts: 1921
Joined: Sun Aug 31, 2008 12:02 am
Platform: Mac + iOS
Location: Melbourne, Australia
Contact:

Thu Aug 20, 2015 11:23 am Post

Briar Kit wrote:As KB and Ioa say, it's an Apple issue.

You can default webarchives to open with TextEdit. Not pretty, but they *shouldn't* update when loaded in the external editor (TextEdit), as they do when loaded in Safari.


Neat trick!
Complete and utter NOMsense.
Image

ce
cem
Posts: 2
Joined: Sat Oct 31, 2015 12:01 am
Platform: Mac

Sat Oct 31, 2015 12:02 am Post

given how useful having an archive of the text and visual information from web pages can be that functions offline, i'd like to request that mac users get some kind of "pdf" import option when they try to pull a web page into research the way pc users do

of course it is possible to export to pdf in advance and import but saving the extra step and keeping the nice formatting scrivener uses would be ideal and a great timesaving feature imo

thanks for a great product!

User avatar
AmberV
Posts: 24307
Joined: Sun Jun 18, 2006 4:30 am
Platform: Mac + Linux
Location: Ourense, Galiza
Contact:

Sat Oct 31, 2015 2:21 am Post

I can’t speak to whether directly doing so is possible with the tools available on a Mac, but all Mac software that can print through the standard OS X printing dialogue can create a PDF, it is a native feature of the operating system, and Scrivener can be a direct target from any print dialogue, from the “PDF” button. If you don’t see that option, you might need to manually install it. Instructions are provided in §11.4, Print as PDF to Scrivener, pg. 141 of the user manual.
.:.
Ioa Petra'ka
“Whole sight, or all the rest is desolation.” —John Fowles

ce
cem
Posts: 2
Joined: Sat Oct 31, 2015 12:01 am
Platform: Mac

Tue Nov 17, 2015 2:31 am Post

i didn't thank you amber, this worked! i needed to use the alias manual install in the manual. very grateful for this!

just fyi for any other chrome users, there is an additional click required to go to 'open system print dialogue'

User avatar
rdale
Posts: 2037
Joined: Tue Jul 14, 2015 1:07 pm
Platform: Mac, Win + iOS
Location: St. Louis, MO
Contact:

Tue Nov 17, 2015 10:24 pm Post

I like to combine the ability to "print" a PDF of a web page directly to Scrivener with the "Reader" feature of Safari, which cleans up the fonts and usually includes only relevant images from the web page. Some Evernote extensions available in other web browsers have a similar website view that can often be used to make uncluttered PDFs for importing to Scrivener.
FKA: robertdguthrie
AKA: R Dale Guthrie, Robert, Mr. Obscure, and "Oh, it's you again".

User avatar
AmberV
Posts: 24307
Joined: Sun Jun 18, 2006 4:30 am
Platform: Mac + Linux
Location: Ourense, Galiza
Contact:

Tue Nov 17, 2015 10:37 pm Post

A good one for Firefox (which also has a built in reader mode) is the HackTheWeb extension, which takes its inspiration from the now defunct Aardvark. When you use that extension, your mouse highlights regions of the page in a red rectangle, and you get a few buttons to do things with those regions. For example you can select the article portion of a page and hit the i key to “isolate”, stripping every element outside of that box out of the page. If you use Stylish as well, you can even save your modifications so that the site will go on looking the way you trimmed it.

It’s useful for those occasions where the reader feature doesn’t work right, or if it just does too much change to the article’s text formatting.
.:.
Ioa Petra'ka
“Whole sight, or all the rest is desolation.” —John Fowles