Archive for the ‘publishing’ category


Testing the search engines: Bing likes antiquity; most favour HTML over PHP

21.09.2022

Bing is spidering new pages, as long as they’re very, very old.

Last week, we added a handful of Lucire pages from 1998 and 1999. An explanation is given here. And I’ve spotted at least two of those among Bing’s results when I do a site:lucire.com search.

As a couple of newer pages have also shown up, I doubt there’s any issue with the template; and the home page now also appears, too. But, by and large, Bing is Microsoft’s own Wayback Machine, and most of the Lucire results are from the 1990s and early 2000s.

It got me thinking: do the other search engines do this, too? For years, Google grandfathered older pages and they came up earlier. (Meanwhile, searches for my own name still have this site, and the company site, down, having lost first and second when we switched from HTTP to HTTPS in March. Contrary to expert opinion, you don’t recover, at least not quickly.)

As Lucire includes the date of the article in the URL, this should be an easy investigation. We’ll only do the first 50 results as that’s all Bing’s capable of. I’ll try not to include any repeat results out of fairness. ‘Contents’ pages’ include the home page, the Lucire TV and Lucire print shopping pages, and tag and category pages.
 
Bing
Contents’ pages ★★★
1997
1998
1999 ★★★★
2000 ★
2001 ★★★★★★★★
2002 ★★
2003 ★★★
2004 ★★★★
2005 ★★
2006
2007 ★★★
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018 ★
2019 ★
2020
2021
2022
 
Google
Contents’ pages ★★★★★★★★★★★★★
1997
1998
1999
2000
2001
2002 ★★
2003
2004 ★★
2005
2006
2007 ★
2008
2009
2010 ★
2011 ★★★
2012 ★
2013 ★★
2014 ★★★
2015 ★
2016 ★★
2017 ★
2018 ★★★
2019 ★★★
2020 ★★★★★★★
2021 ★
2022 ★★★★
 
Mojeek
Contents’ pages ★★★★★★
1997
1998
1999
2000
2001
2002
2003
2004 ★
2005
2006
2007
2008
2009 ★
2010 ★★
2011 ★★
2012 ★★★
2013 ★★★★
2014 ★★★
2015 ★★★★★
2016 ★★★★★★★
2017 ★★★★★★
2018 ★★★
2019 ★★★★
2020 ★★★
2021
2022
 
Baidu
Contents’ pages ★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018 ★
2019 ★
2020
2021 ★★★
2022 ★
 
Yandex
Contents’ pages ★★★★★
1997
1998
1999 ★★★★★
2000 ★★★★★★
2001 ★★★
2002 ★★★
2003 ★★★
2004 ★
2005
2006
2007 ★★★★
2008 ★★
2009 ★★
2010 ★★★★
2011 ★★★
2012 ★★
2013 ★
2014 ★★
2015
2016
2017
2018
2019
2020 ★★★
2021 ★
2022
 

To me, that was fascinating. My instincts weren’t wrong with Bing: it’s old and it favours the old (two of the restored articles were indexed). From the first 50 results, 18 results were repeats—that’s 36 per cent. I’m of the mind that Bing is so shot that it can only index old pages that don’t take up much space. New ones have a lot more data to them, generally.

Google does a good job with the top-level and second-level contents’ pages, though there were a few strange tag indices. But the distribution is what you’d expect: people would search for more recent stories. I know we had some popular stories from 2002 that still get hit a lot.

Mojeek has a similar distribution, though it should be noted that you can’t do a blanket site: search. There must be a keyword, and in this case it’s Lucire. The 2016 pages form the mode, which I don’t have a huge problem with; it’s better than the 2001 pages, which Bing has over everything else.

Baidu’s one is crazy as individual stories are seldom spat out in the first five pages, the search engine preferring tag indices, though half a dozen later story pages do make it into its top 50.

Finally, Yandex leans toward older pages, too, including our most popular 2002 piece. It’s the 2000 stories it has the most of among the top 50, and there’s a strange empty period between 2015 and 2019. But at least there is a fairer distribution than Bing can muster.

The other query that I had was whether these search engines were biasing their results toward HTML pages, rather than PHP ones. If that’s the case, then it could explain Bing’s preference for the old stuff (Lucire didn’t have PHP pages till 2008; prior to that it was all laboriously hand-coded, albeit within templates.)
 
Bing
★★★★★★★★★★★★★★★★★★★★★★★★★ HTML
★ PHP
 
Google
★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★ HTML
★★★★★★★★★ PHP
 
Mojeek
★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★ HTML
★★★★★★★★★★★★★★★★★ PHP
 
Baidu
★★★★★★★★★★ HTML
★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★ PHP
 
Yandex
★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★ HTML
★★★★★★ PHP
 

I think we can safely say there’s a preference for HTML over PHP. Mojeek brings up a lot of HTML pages after the top 50, even though this sample shows the split isn’t as severe.

Our PHP pages are less significant though: they contain news stories, and these are often ones other media covered, too. But I would have thought some of the more popular stories would have made the cut, and here it’s Mojeek’s distribution that looks superior to the others’. It seems like it’s actually analysing the page content’s text, which is what you want a search engine to do.

Baidu’s PHP-heaviness is down to all the tag indices—rendering it not particularly helpful as a search engine.

On these two tests, Mojeek and Google rank best, and Yandex comes in third. Baidu and Bing are a distant fourth and fifth.

Tags: , , , , , , , , , , ,
Posted in China, culture, internet, media, publishing, technology, UK, USA | No Comments »


The Lucire tribute to HM Queen Elizabeth II

09.09.2022

I wrote the below in Lucire—even though plenty of publications have covered our monarch’s passing, it still felt right to acknowledge it. After all, she had appeared in Lucire a few times.
 
With the passing of HM Queen Elizabeth II on Thursday UK time, it would be remiss of this magazine to not mark this world event.

During the 25 years of Lucire, the Queen has featured several times, mostly from events that she attended. We weren’t around when she was newly crowned in her coronation gown by Norman Hartnell, and wearing the latest British fashions in her youth, a glamorous symbol of a new Elizabethan era that lifted the United Kingdom’s mood after World War II and continued rationing. But it is easy to imagine the coronation in 1953 being a dazzling, colourful event, and indeed it was covered in the likes of British Vogue at the time.

Her era has seen unprecedented change. As the longest-serving monarch in British history, she presided over an era which saw television become mainstream (a technology that she embraced with her Christmas message), many former colonies gain their independence, the dawn of the World Wide Web, the end of apartheid in South Africa, and both her country’s entry into and exit from what is now the European Union.

Much has already been said about HM the Queen’s sense of duty, and how she still read her red box’s worth of papers as head of state right to the end. On Tuesday she asked Liz Truss as the new prime minister—the Queen’s 15th, having begun with Sir Winston Churchill when she ascended to the throne—to form a government.

Here in Lucire the late Queen has attended events we happened to cover, beginning in 2008, with her last appearance at the Cartier Queen’s Cup in 2017.

I only caught a glimpse of her during a state visit to New Zealand in 2002 during her golden jubilee. It was her last visit to Aotearoa.

The visit was very subdued and HM the Queen and HRH the Duke of Edinburgh were whisked from the airport round the back roads of Rongotai, past the main street by Lucire’s then-HQ. I managed to photograph them as they drove by.

A neighbourhood shop had a staff member who was a diehard monarchist. I mentioned I had a photo of the royal couple and later gifted her my print. I still have the negative somewhere.

At the time, my sense was that our Labour government had republican leanings and downplayed the royal visit, hence ferrying them in the viceregal Daimler past industrial areas; it was a far cry from an earlier visit I witnessed in 1981 when as a school pupil, my schoolmates and I lined the drive at Government House to welcome her.

As someone who chose to retain my British nationality (I dutifully renew my passport every 10 years), as well as adopting my New Zealand one in 1980, I admit to having a tremendous amount of respect for HM Queen Elizabeth II and her unwavering sense of duty. Some of us born in Hong Kong in the 1970s, whose parents had memories of less pleasant times behind the Bamboo Curtain, appreciated the freedoms, although they stopped short of democracy, that we enjoyed in a Crown colony. Up to a point: my father said he could have worked harder to lose his Chinese accent after fleeing Taishan for Hong Kong after the communist revolution of 1949, but he chose not to as he didn’t want to be seen as sycophantic to the colonial power.

It was thanks to the Commonwealth that my Hong Kong-born, but China-raised, mother was able to obtain her nursing qualification from the General Nursing Council for England and Wales. When we emigrated to New Zealand, that made her transition into her job that much easier, as it was considered a notch above the rest. (Having said that, the Hospital Board put her on a lower pay grade than what she deserved, leading my parents to fight for it, with the help of Sir Francis Kitts, a family friend and the former mayor of Wellington. We won.)

When we came here, one familiar thing was that the currency had the Queen on it, and it was her constant presence that told you that there were, in principle at least, shared values. While we can rightfully critique the Empire and what it was built on, at least for this chunk of history, it was a reassurance for us as émigrés that there would be the rule of law in our new country, something that, as my parents could attest, China lacked during the difficult years of the war and immediately after.

My father’s preferred form of governance was social democracy, but he appreciated a constitutional monarchy; and my own studies at law school concluded that while an imperfect system, it was one which I, too, valued. The prospect of one of our own being president, at least to the law student me in 1992, seemed unfathomable and potentially divisive.

The success of the system does depend on our faith and trust in the monarch. HM Queen Elizabeth II gave us that sense, as one who placed duty first. As this nation enters into a period of official mourning, we also wonder what her successor, HM King Charles III, will bring to the table, with his interests in the environment and a UK government that he might not see eye to eye with.

Whatever the future, we pay tribute to HM Queen Elizabeth II and mark the close of this second Elizabethan age.

Tags: , , , , , , , , , , , , , ,
Posted in China, culture, Hong Kong, media, New Zealand, publishing, technology, UK, Wellington | 4 Comments »


Google libels, and gets away with it

08.09.2022

This isn’t a dig at Red Points or Hearst this time, since I received an apology and they did what they said: the DMCA claim notices were withdrawn and they have revised their systems. If anything, Hearst SL wound up quite cordial, as their New York office has tended to be in my dealings with them.

This is a dig at Google who only today sent what appears to be the final confirmation that our URLs have been reinstated.

This sorry saga began on August 17 and essentially Google told people searching for various terms that we were thieves till today.

The fact this virtual monopoly can libel someone with impunity—and has done so for years—should disturb any right-thinking person.
 
Speaking of Google, we gave in and connected the revised about.shtml page on the Lucire website to a current page. This was a page we hadn’t linked since the 2000s, but kept coming up high on site:lucire.com searches on Google and, formerly, Bing.

Since I typically don’t use Google for searches, and have not done regularly for a dozen years, I had no idea until investigating the collapse of Bing’s index recently. (Itʼs still just as compromised, despite claiming it has a higher number of results for any given search. I see no real evidence of this.)

Admittedly, people might seek an ‘About’ page, so instead of their reading a 2004 page, we took the content from our licensing website and created a new one. The old one is linked from there, as it’s quite a novelty.

Tags: , , , , , , ,
Posted in business, internet, publishing, technology, USA | No Comments »


Autocade slowly gets to 29 million page views

27.08.2022

It took a while, but Autocade has now finally reached 29 million page views.

The stats’ page since the count was reset shows 1,362,506 views. Add that to the 27,647,011 recorded on March 19, 2022, and we have well and truly crossed the 29 million mark (by 9,517, in fact).

We probably got there yesterday given that the counter is no longer updated live (so much for improving technology), and I didn’t get a chance to look in.

Sadly, this does mean the slowest growth in reader numbers since 2019.

I’m sure part of it is down to Bing’s collapse, which must have shaved off at least six per cent of our daily totals.

What I have found fascinating is our model leaderboard. The Ford Taunus 80 had been leading for some time since the reset, but it’s been well and truly beaten by the current Toyota Corolla. What caused a sudden surge during August is anyone’s guess, but all I know is that I’m grateful for it. It’s a newish page as well.

The Kia Morning (TA) is now third, another newer entry that shot up the ranks.

I’ve also been watching the pages for the Peugeot 206+ and 207 jostle for fourth place against the Daewoo Winstorm. (At the time of writing, the Winstorm is ahead by five views.) Another former leaderboard champ, the Ford Fiesta Mk VII, now sits in sixth, while the Renault Mégane II, Opel Astra J, Rover SD1 and Ford Cortina Mk III complete the top 10.

Here’s how the Autocade traffic watch is going:
 
March 2008: launch
April 2011: 1,000,000 (three years for first million)
March 2012: 2,000,000 (11 months for second million)
May 2013: 3,000,000 (14 months for third million)
January 2014: 4,000,000 (eight months for fourth million)
September 2014: 5,000,000 (eight months for fifth million)
May 2015: 6,000,000 (eight months for sixth million)
October 2015: 7,000,000 (five months for seventh million)
March 2016: 8,000,000 (five months for eighth million)
August 2016: 9,000,000 (five months for ninth million)
February 2017: 10,000,000 (six months for 10th million)
June 2017: 11,000,000 (four months for 11th million)
January 2018: 12,000,000 (seven months for 12th million)
May 2018: 13,000,000 (four months for 13th million)
September 2018: 14,000,000 (four months for 14th million)
February 2019: 15,000,000 (five months for 15th million)
June 2019: 16,000,000 (four months for 16th million)
October 2019: 17,000,000 (four months for 17th million)
December 2019: 18,000,000 (just under three months for 18th million)
April 2020: 19,000,000 (just over three months for 19th million)
July 2020: 20,000,000 (just over three-and-a-half months for 20th million)
October 2020: 21,000,000 (three months for 21st million)
January 2021: 22,000,000 (three months for 22nd million)
April 2021: 23,000,000 (three months for 23rd million)
June 2021: 24,000,000 (two months for 24th million)
August 2021: 25,000,000 (two months for 25th million)
October 2021: 26,000,000 (two months for 26th million)
January 2022: 27,000,000 (three months for 27th million)
April 2022: 28,000,000 (three months for 28th million)
August 2022: 29,000,000 (four months for 29th million)
 

Toyota’s unsuccessful Verossa was the latest entry into the database.
 

Tags: , , , , , , , , , , ,
Posted in cars, internet, media, New Zealand, publishing | No Comments »


Bing hates novelty—it’s really Microsoft’s Wayback Machine

27.08.2022

Bing is still very clearly near death, as this latest site: search shows.
 

 

It manages a grand total of 10 pages from Lucire, and as outlined before, some are pages that have not been linked to for 17 years.

I purposely updated some of the pages Bing had in its limited capacity, and strangely, those have disappeared! Bing doesn’t want anything new, as it appears to be Microsoft’s Wayback Machine.

The fifth result here is a case in point. Some of you may recall lucire.com/about.shtml appearing in all the search engines, including Bing. This is a page last updated in 2004, with some final tweaks in 2012 (I assume for ad code; I don’t recall). It was a page that I decided I would stick on to a new template, since the search engines loved it so much. I copied the text from our licensing site. And, for the sake of online archæology, I put the 2004 page exactly as it was into a file called about-2004.shtml.

Bing must still be alive enough to spider and index the renamed page, but it rejects the revised about.shtml!

It’s similar to what I wrote in mid-August when I updated other ancient pages from the early 2000s: Bing rejected them, including a frameset that now pointed at the latest page!

You may be thinking: obviously, you are doing something wrong with your newer code, Jack, for Bing to favour the old stuff. But look at the fourth result: it’s from 2020, the one “new” page that Bing has managed to index and show. I don’t think we have anything wrong with our code if this page has made it in.

Google happily included the new about.shtml.

A search for Lucire itself on Bing now does include the home page, which is a new development in a search engine that’s limping along. So much for the earlier claim that there were issues with the page that prevented it from appearing.

Tags: , , , , , , , , , , , ,
Posted in internet, media, publishing, technology, USA | No Comments »


The Red Points saga: this might finally be resolved

24.08.2022

Nine days since the first DMCA notice was lobbed against us, the saga has finally reached the powers-that-be at Hearst SL.

And once it did, things began happening quickly. I’ve heard from their head of legal, and what he’s outlined to me seems like a good resolution to the whole saga.

He tells me some changes have been made to Red Points Solution SL’s processes, which I think is a good outcome if it saves others the grief of what I’ve had to deal with—especially while contending with publishing deadlines and the day-to-day running of a company. It was a bigger distraction than I would have liked to admit.

In a gesture of goodwill, I offered to set to private the two stories we published on the Lucire website over the whole affair.

I suggested to him that I update everyone here, since you might have thought that the disappearance of the two articles was down to Red Points!

I shudder to think what would have happened if I didn’t have contact email addresses for senior VPs at Hearst Communications, Inc. or former Lucire team members who wound up working for Hearst. Or how someone without a legal background specializing in IP would have felt. Not everyone would be in this position.

It’s still concerning to me that Google continues to state that results have been removed in site searches for us, and for the topics those articles covered. Basically, they’re saying we’re thieves, and I don’t think that’s fair dinkum. As Google works at a glacial pace, I assume the notices will eventually disappear once they receive Red Points’ withdrawals.

I’ve also received an apology from Red Points’ CMO. The gentlemanly thing to do is to accept it. It will be interesting to see how long it takes for Google to stop saying we stole stuff.

Tags: , , , , , , , , , , , , , ,
Posted in business, internet, media, New Zealand, publishing, technology, USA | No Comments »


Red Points Solution SL walks right into it, attempts to shut down free speech via DMCA

23.08.2022

This is too good. Now, Hearst Communications, Inc. was sensible enough to realize that what I raised was real, and a senior VP put me on to a colleague dealing with Hearst Magazines International. Nothing yet, but I wrote a release, sent it to a few colleagues, and published it on Lucire describing what had happened. As it’s going in to Lucire, unlike Google, I’m really careful about libel.

Just now, Red Points Solution SL has been by and issued another notice. They can’t deal with the negative publicity so they play the only card they know how: issuing another DMCA notice to Google and leaving Hearst SL wide open to a penalty of perjury.

I mean, I’ve seen stupid (like that time a former disgruntled staffer wrote an anonymous note to people who knew me but hand-addressed the envelope), but this is like walking into a trap (that I didn’t even realize I had set!).

Now, what if word got out even more widely that Red Points Solution SL is shutting down free speech? Time to send the release more widely?

If only I had more time—but this might be tomorrow’s free-time project.

Tags: , , , , , , , , , , , , ,
Posted in internet, media, publishing, technology | No Comments »


Testing the seven search engines in the world

22.08.2022

After reading Mojeek’s blog post from last July, I learned there are only seven search engines in the world now. In other words, I was checking more search engines out in the 1990s. It’s rather depressing, especially as the search market is largely a monopoly with Google dominating it (and all the ills that brings), and Bing and its licensees (like Duck Duck Go) with their 6 per cent.

Knowing there are seven, I fed the site:lucire.com search into all of them to see where each stood.

The first figure is the claimed number of results, the second the actual number shown (without repeats removed, which Bing is guilty of).

I can’t use Brave here as its site search is Bing as well.

Yandex appears to be capped at 250 and Mojeek at 1,000, but at least they aren’t arbitrary like Google and Baidu. Baidu has a lot of category and tag pages from the Wordpress section of our site to bump up the numbers.
 
Gigablast 0/0
Sogou 19/13
Bing 243/50
Baidu 13,700/213
Yandex 2,000/250
Google 6,280/315
Mojeek 3,654/1,000
 

Frankly, more of us should go to Mojeek. It can only get better with a wider user base. Unlike Bing, it hasn’t collapsed. I know most of you will keep going to Google, but I just don’t like the look of those limits (not to mention the massive privacy issues).

Mojeek is now at 5,900 million pages, which must be the largest index in the west outside of Google.

Tags: , , , , , , , , , , , , , , , , ,
Posted in China, internet, publishing, technology, UK, USA | No Comments »


Google finally responds to our first counter-notification

21.08.2022

I suppose it’s positive that Google has finally responded to our first counter-notification against Hearst SL’s and Red Points Solution SL’s fraudulent DMCA notice. Hey, Google, why don’t you begin by asking your complainants for proof before presuming an innocent party guilty? Then used your milliards of dollars and high-tech to see that our work is original? Would have saved us a lot of time.

You’ll soon see the other two counter-notices I filed on the first issue alone while I waited and waited and waited for you to respond. Failing to do that first step has cost us all time. And you knew of this problem back in the second half of the 2010s, if not before.

This system is really broken.
 

 

Oh well, another two weeks of libel by Google on the first issue alone. Everyone: use Mojeek.

Tags: , , , , , , , , , ,
Posted in internet, media, publishing, technology, USA | No Comments »


Companies worth millions engaging in fraud, and Google is their weapon

20.08.2022

Yesterday morning, we received a second notice with two more URLs—one with wholly our own content—from Hearst SL and its contractor, Red Points Solution SL.

I’ve done a bit more digging and it’s usually fraudsters who engage in this behaviour. You can read more about them in Techdirt, Mashable and Search Engine Land.

With their millions of dollars, I guess these two Spanish companies are now in the same game of fraud.

And Google believes them, even though Mashable wrote about these techniques in 2018.

If it’s that easy to manipulate Google, then it’s finished as a credible search engine.

Meanwhile, Red Points Solution and Hearst SL open themselves up to charges of perjury. Not too smart there.

Three firms with millions, even milliards, of dollars who don’t like the independents, and one firm now falsely claiming ownership of work from us, French Sole, BFA.com, and L’Oréal. With L’Oréal, why would you involve your own advertiser? Does Hearst SL want to slit its own wrists as a company?

Tags: , , , , , , , , , , , , , , ,
Posted in internet, media, New Zealand, publishing, technology, USA | No Comments »