Posts tagged ‘Bing’


Bing increases Techdirt’s results, saving it some embarrassment

13.01.2023

After notifying Mike Masnick, the founder of Techdirt, about my findings about Bing, coincidentally, the search engine began spidering his latest articles. It claimed to have 150 results, and delivered 92, many of which were repeated from page to page as usual. Tonight it’s a claimed 249, delivering 173.

Techdirt is well respected and very popular, and disliked presently only by the Musk bros. What’s the likelihood that Microsoft knew about their shortcomings here and corrected things? I wasn’t exactly quiet, and I told more than Mike and the readers of this blog (I went on Reddit, for example), since it was so ridiculous that Bing could only deliver one result for such a major website. It’s embarrassing for them, so they decided to do the right thing. Like any Big Tech firm: do nothing unless you risk getting bad press. This is right out of the Facebook playbook, for example.

What a pity they could not do the right thing for the rest of us.

Just as a comparison, since I am nothing if not fair. Here are the claimed number of results versus the number delivered for site:techdirt.com:
 
Mojeek: 48,606/1,000
Google: 54,700/394
Bing: 249/173
Yandex: 2,000/250
Baidu: —/1
Gigablast: 0/0
Yep: —/10
 

In that context, it doesn’t look so bad, especially as a lot of Yandex results are of Techdirt’s various directories and largely useless.

It’s not so hot for site:lucire.com over at Bing:
 
Mojeek: 3,481/1,000
Google: 5,970/307
Bing: 2/10
Yandex: 2,000/250
Baidu: 1,480/400
Gigablast: 0/0
Yep: —/10
 

I’m not kidding: Bing claims it had 2 results and delivered 10. Looks like one of those rare times they underestimated. Well off the mark of the 55 they have been doing since mid-2022 and that was pathetic. There is nothing in the results from after 2007. Maybe fixing Techdirt’s results meant that Bing had so little computing power for every other site!

Well, I guess I can no longer claim that for a site:lucire.com search that Bing is repeating results from page to page, since it only has one page.
 


You may also like

Tags: , , , , , , , , , ,
Posted in internet, media, publishing, technology, USA | No Comments »


This surely makes it blatantly obvious that Bing is near death

06.01.2023

Here’s a site I’ve always liked: Techdirt. It’s incredibly influential, and reports on the technology sector. Mike Masnick’s run it for the same length of time as I’ve run Lucire (25 years, and counting).

And when it comes to Bing’s index collapse—or whatever you wish to call it—it’s no more pronounced than here (well, at least among the sites that even get listed). For site:techdirt.com:
 
Google: 54,700 results, 393 visible
Mojeek: 48,818 results, 1,000 visible
Yandex: 2,000 results, 250 visible
Gigablast: 200 results, 200 visible
Yep: 10 results, 10 visible
Baidu: 1 result, 1 visible
Bing: 1 result, 1 visible
 



 

One. This is a site that dates back to the 1990s and churns out numerous articles daily, and that’s how bad Bing’s got. Naturally, it’s the same with all the Bing clones, like Yahoo (the one with no logo now), Ecosia, Qwant, Neeva, Duck Duck Go, etc. Unlike Baidu, Bing doesn’t have communist Chinese censorship as an excuse. Or does it?

If you ever needed proof something was really, really off at Redmond, this is it. And still the clones stay silent.
 





 
PS.: If you search for Techdirt on Bing, its home page does not even come up in the top 10.
 
P.PS.: Here’s what WorldWideWebSize.com has to report (thanks to nf3xn for posting it first on their Mastodon). I believe the site is wrong when it calculates that the total index was up as high as it is on the left of the graph: basically it takes what Bing claims is the number of results as the truth, and we know it lies.
 


You may also like

Tags: , , , , , , , , , , , , , , , ,
Posted in internet, media, publishing, technology, USA | No Comments »


On the verge of a change for the better

13.11.2022

I can’t find the original toot on Mastodon but I was led to this piece in the MIT Technology Review by Chris Stokel-Walker, ‘Here’s how a Twitter engineer says it will break in the coming weeks’.

As I’ve cut back on my Twitter usage, I haven’t witnessed any issues, but it does highlight the efforts Big Tech goes to in order to maintain their sites. If anything, it explains why Facebook failed so regularly and so often, as documented on this blog.

The prediction? An anonymous engineer tells the Review:

“Things will be broken. Things will be broken more often. Things will be broken for longer periods of time. Things will be broken in more severe ways,” he says. “Everything will compound until, eventually, it’s not usable.”

Twitter’s collapse into an unusable wreck is some time off, the engineer says, but the telltale signs of process rot are already there. It starts with the small things: “Bugs in whatever part of whatever client they’re using; whatever service in the back end they’re trying to use. They’ll be small annoyances to start, but as the back-end fixes are being delayed, things will accumulate until people will eventually just give up.”

I wonder if they will give up, since I’ve encountered Facebook bugs almost since the day I joined, and there are still people there. In fact, like tech experts, some fellow users even blame me, saying that I encounter more bugs than anyone they know. I doubt this: I just remember the bugs better than they do. We’ve all been subject to the well publicized global outages—just that the majority don’t remember them.

While one contact of mine disagrees, I think Twitter won’t collapse on its own. Mastodon could be an alternative, encouraging people away, just as Google enticed Altavista users over; or Facebook saw to the end of Myspace. There seems to be a new era coming, sweeping away the old, especially as Big Tech falters. Twitter has lost a huge chunk of its staff, and Facebook has slashed its ranks by 11,000. Mojeek has emerged as a credible, privacy-respecting alternative to Google—as Microsoft Bing collapses, taking with it its proxies, Duck Duck Go, Ecosia, Yahoo! and others. The web’s future feels more open, more optimistic, with these technologies spurring civilized dialogue and sparking ideas. It could almost be time to bring back the day-glo on a Wired cover.

On the other hand, maybe Twitter can collapse on its own, with a fake blue-tick EIi LiIIy, looking to the world like Eli Lilly, announcing free insulin and sending Eli Lilly’s share price tumbling, wiping milliards off its value. With advertisers pulling out (little wonder if their Twitter account managers are fired) it may look very different come Christmas.


You may also like

Tags: , , , , , , , , , ,
Posted in business, internet, technology, USA | No Comments »


Forget the 2010s and 2020s, Bing’s results are firmly in the 2000s now

09.10.2022

Immediately after blogging about Bing being able to pick up an article from 2022, Microsoft’s collapsing search engine has reverted back to being the Wayback Machine. There was just over a week of it living in the 2020s, but it seems it’s too much for them.

It’s back to, well, Bing Vista, for want of a better term. Of the 50 results (out of a claimed 120!) that it’s capable of returning for site:lucire.com, here is how it breaks down based on the publication year of the article. Since my last test, Bing has eliminated the 2018 and 2019 results (one page per year). We wouldn’t want to think it could deliver anything from the last decade, would we?
 
Bing
Contents’ pages ★★
1997
1998
1999
2000
2001 ★★★★★
2002
2003 ★★★★
2004 ★★★★
2005 ★★
2006 ★
2007 ★★★★★★★
2008 ★★
2009 ★★
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
 

There were 29 unique results, which means 21 were repeats—42 per cent! Bing says it had 120 results but really only had 29. To fill up the 50 it had to show 21 results multiple times!

Let’s see how Google fared for the first 50 results.
 
Google
Contents’ pages ★★★★★★★★★★
1997
1998
1999
2000
2001
2002 ★★
2003
2004 ★★
2005 ★
2006
2007 ★
2008
2009 ★
2010 ★★
2011 ★★★
2012 ★★
2013 ★★
2014 ★★★
2015 ★★
2016 ★★
2017 ★★
2018 ★★
2019 ★★★
2020 ★★★
2021 ★★
2022 ★★★★★
 
Google has moved again since we began looking at things. In an earlier test tonight, Google had two repeat results, which was a surprise. But I wasn’t able to replicate it when I did the one for the blog post.

No such issues at Mojeek, where every entry is unique. They really are more capable of delivering search engine results for site searches that are superior to the other two’s.
 
Mojeek
Contents’ pages ★★★★★★★★
1997
1998
1999
2000
2001
2002
2003
2004 ★
2005
2006
2007
2008
2009 ★
2010 ★★
2011 ★★
2012 ★★★
2013 ★★★★★
2014 ★★★
2015 ★★★★★
2016
2017
2018
2019 ★★★★
2020 ★★
2021 ★★★★★★★★★
2022 ★★★★★★
 
An improvement on our September 21 test, where Mojeek has managed to capture more 2020s pages as part of its top 50.

I won’t run the other search engines through this—I just wanted two points of comparison to highlight how ridiculous Bing remains, with the resultant effect on web traffic. It means Duck Duck Go, Qwant, Ecosia, Yahoo and others, which are also Bing, are just as compromised.

I might lay off them for a while as we know it’s crap and things aren’t going to change. Microsoft has firmly entrenched itself as a bunch of liars, like their other Big Tech counterparts.


You may also like

Tags: , , , , , , , , , ,
Posted in internet, technology, USA | No Comments »


Bing actually indexes a page from 2022—Microsoft must be in shock

08.10.2022

This is a miracle. Bing actually indexed and showed something from Lucire from 2022. Of course, since it’s incapable of remembering what it had shown in its results earlier, the story was repeated twice on subsequent pages.

Since I began my tests earlier this year (and finding out yet another area that tech companies brag about is actually half-baked and filled with BS), this is the first story from 2022 that Bing has picked up. Who knew, Microsoft’s much-talked-about search engine actually getting to something from after 2007?
 



You may also like

Tags: , , , , , ,
Posted in internet, technology, USA | No Comments »


Testing the search engines: Bing likes antiquity; most favour HTML over PHP

21.09.2022

Bing is spidering new pages, as long as they’re very, very old.

Last week, we added a handful of Lucire pages from 1998 and 1999. An explanation is given here. And I’ve spotted at least two of those among Bing’s results when I do a site:lucire.com search.

As a couple of newer pages have also shown up, I doubt there’s any issue with the template; and the home page now also appears, too. But, by and large, Bing is Microsoft’s own Wayback Machine, and most of the Lucire results are from the 1990s and early 2000s.

It got me thinking: do the other search engines do this, too? For years, Google grandfathered older pages and they came up earlier. (Meanwhile, searches for my own name still have this site, and the company site, down, having lost first and second when we switched from HTTP to HTTPS in March. Contrary to expert opinion, you don’t recover, at least not quickly.)

As Lucire includes the date of the article in the URL, this should be an easy investigation. We’ll only do the first 50 results as that’s all Bing’s capable of. I’ll try not to include any repeat results out of fairness. ‘Contents’ pages’ include the home page, the Lucire TV and Lucire print shopping pages, and tag and category pages.
 
Bing
Contents’ pages ★★★
1997
1998
1999 ★★★★
2000 ★
2001 ★★★★★★★★
2002 ★★
2003 ★★★
2004 ★★★★
2005 ★★
2006
2007 ★★★
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018 ★
2019 ★
2020
2021
2022
 
Google
Contents’ pages ★★★★★★★★★★★★★
1997
1998
1999
2000
2001
2002 ★★
2003
2004 ★★
2005
2006
2007 ★
2008
2009
2010 ★
2011 ★★★
2012 ★
2013 ★★
2014 ★★★
2015 ★
2016 ★★
2017 ★
2018 ★★★
2019 ★★★
2020 ★★★★★★★
2021 ★
2022 ★★★★
 
Mojeek
Contents’ pages ★★★★★★
1997
1998
1999
2000
2001
2002
2003
2004 ★
2005
2006
2007
2008
2009 ★
2010 ★★
2011 ★★
2012 ★★★
2013 ★★★★
2014 ★★★
2015 ★★★★★
2016 ★★★★★★★
2017 ★★★★★★
2018 ★★★
2019 ★★★★
2020 ★★★
2021
2022
 
Baidu
Contents’ pages ★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018 ★
2019 ★
2020
2021 ★★★
2022 ★
 
Yandex
Contents’ pages ★★★★★
1997
1998
1999 ★★★★★
2000 ★★★★★★
2001 ★★★
2002 ★★★
2003 ★★★
2004 ★
2005
2006
2007 ★★★★
2008 ★★
2009 ★★
2010 ★★★★
2011 ★★★
2012 ★★
2013 ★
2014 ★★
2015
2016
2017
2018
2019
2020 ★★★
2021 ★
2022
 

To me, that was fascinating. My instincts weren’t wrong with Bing: it’s old and it favours the old (two of the restored articles were indexed). From the first 50 results, 18 results were repeats—that’s 36 per cent. I’m of the mind that Bing is so shot that it can only index old pages that don’t take up much space. New ones have a lot more data to them, generally.

Google does a good job with the top-level and second-level contents’ pages, though there were a few strange tag indices. But the distribution is what you’d expect: people would search for more recent stories. I know we had some popular stories from 2002 that still get hit a lot.

Mojeek has a similar distribution, though it should be noted that you can’t do a blanket site: search. There must be a keyword, and in this case it’s Lucire. The 2016 pages form the mode, which I don’t have a huge problem with; it’s better than the 2001 pages, which Bing has over everything else.

Baidu’s one is crazy as individual stories are seldom spat out in the first five pages, the search engine preferring tag indices, though half a dozen later story pages do make it into its top 50.

Finally, Yandex leans toward older pages, too, including our most popular 2002 piece. It’s the 2000 stories it has the most of among the top 50, and there’s a strange empty period between 2015 and 2019. But at least there is a fairer distribution than Bing can muster.

The other query that I had was whether these search engines were biasing their results toward HTML pages, rather than PHP ones. If that’s the case, then it could explain Bing’s preference for the old stuff (Lucire didn’t have PHP pages till 2008; prior to that it was all laboriously hand-coded, albeit within templates.)
 
Bing
★★★★★★★★★★★★★★★★★★★★★★★★★ HTML
★ PHP
 
Google
★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★ HTML
★★★★★★★★★ PHP
 
Mojeek
★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★ HTML
★★★★★★★★★★★★★★★★★ PHP
 
Baidu
★★★★★★★★★★ HTML
★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★ PHP
 
Yandex
★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★★ HTML
★★★★★★ PHP
 

I think we can safely say there’s a preference for HTML over PHP. Mojeek brings up a lot of HTML pages after the top 50, even though this sample shows the split isn’t as severe.

Our PHP pages are less significant though: they contain news stories, and these are often ones other media covered, too. But I would have thought some of the more popular stories would have made the cut, and here it’s Mojeek’s distribution that looks superior to the others’. It seems like it’s actually analysing the page content’s text, which is what you want a search engine to do.

Baidu’s PHP-heaviness is down to all the tag indices—rendering it not particularly helpful as a search engine.

On these two tests, Mojeek and Google rank best, and Yandex comes in third. Baidu and Bing are a distant fourth and fifth.


You may also like

Tags: , , , , , , , , , , ,
Posted in China, culture, internet, media, publishing, technology, UK, USA | No Comments »


Bing hates novelty—it’s really Microsoft’s Wayback Machine

27.08.2022

Bing is still very clearly near death, as this latest site: search shows.
 

 

It manages a grand total of 10 pages from Lucire, and as outlined before, some are pages that have not been linked to for 17 years.

I purposely updated some of the pages Bing had in its limited capacity, and strangely, those have disappeared! Bing doesn’t want anything new, as it appears to be Microsoft’s Wayback Machine.

The fifth result here is a case in point. Some of you may recall lucire.com/about.shtml appearing in all the search engines, including Bing. This is a page last updated in 2004, with some final tweaks in 2012 (I assume for ad code; I don’t recall). It was a page that I decided I would stick on to a new template, since the search engines loved it so much. I copied the text from our licensing site. And, for the sake of online archæology, I put the 2004 page exactly as it was into a file called about-2004.shtml.

Bing must still be alive enough to spider and index the renamed page, but it rejects the revised about.shtml!

It’s similar to what I wrote in mid-August when I updated other ancient pages from the early 2000s: Bing rejected them, including a frameset that now pointed at the latest page!

You may be thinking: obviously, you are doing something wrong with your newer code, Jack, for Bing to favour the old stuff. But look at the fourth result: it’s from 2020, the one “new” page that Bing has managed to index and show. I don’t think we have anything wrong with our code if this page has made it in.

Google happily included the new about.shtml.

A search for Lucire itself on Bing now does include the home page, which is a new development in a search engine that’s limping along. So much for the earlier claim that there were issues with the page that prevented it from appearing.


You may also like

Tags: , , , , , , , , , , , ,
Posted in internet, media, publishing, technology, USA | No Comments »


Testing the seven search engines in the world

22.08.2022

After reading Mojeek’s blog post from last July, I learned there are only seven search engines in the world now. In other words, I was checking more search engines out in the 1990s. It’s rather depressing, especially as the search market is largely a monopoly with Google dominating it (and all the ills that brings), and Bing and its licensees (like Duck Duck Go) with their 6 per cent.

Knowing there are seven, I fed the site:lucire.com search into all of them to see where each stood.

The first figure is the claimed number of results, the second the actual number shown (without repeats removed, which Bing is guilty of).

I can’t use Brave here as its site search is Bing as well.

Yandex appears to be capped at 250 and Mojeek at 1,000, but at least they aren’t arbitrary like Google and Baidu. Baidu has a lot of category and tag pages from the Wordpress section of our site to bump up the numbers.
 
Gigablast 0/0
Sogou 19/13
Bing 243/50
Baidu 13,700/213
Yandex 2,000/250
Google 6,280/315
Mojeek 3,654/1,000
 

Frankly, more of us should go to Mojeek. It can only get better with a wider user base. Unlike Bing, it hasn’t collapsed. I know most of you will keep going to Google, but I just don’t like the look of those limits (not to mention the massive privacy issues).

Mojeek is now at 5,900 million pages, which must be the largest index in the west outside of Google.


You may also like

Tags: , , , , , , , , , , , , , , , , ,
Posted in China, internet, publishing, technology, UK, USA | No Comments »


More of Bing’s follies (they just keep coming)

16.08.2022

I see WorldWideWebSize.com has wised up and figured out Bing was having them on about the number of results it had for their search terms.
 

 

When Bing says it has 300-odd results for the site:lucire.com yet doesn’t actually go beyond a limit of around 50 (where it has been stuck for many months), I was actually being generous. I never deducted the repeated results on the pages that it did show.

Here’s a case in point: an ego search for my own name. These are the first four pages. I realize I have the graphics a bit small, but you should be able to make out just how many pages have been repeated here. A regular search engine like Mojeek and Google show you different results on each page. Bing doesn’t.
 




 

More strange happenings: you’ll recall I noted that pages we haven’t linked to since the 2000s were up top in a site search on Bing for lucire.com. The very top one was lp.html, a frameset (yes, it’s that old). I did what I thought would be logical in such a circumstance: I pointed one of the frames to the current 2022 page (which is still regular HTML, but with Bootstrap).

Result in Bing: it’s vanished.

Did the same to news.html, not linked to since 2012.

Vanished.
 

 

The current news page is Wordpress, but Bing still manages to index the occasional Wordpress page on our site. The fact it’s PHP shouldn’t make a difference.

These pages are just too new for Bing, which is really Microsoft’s own Wayback Machine. And Duck Duck Go’s, and Qwant’s, and a whole manner of search engines’.
 
Meanwhile at Brave: it does have an independent spider but admits to using the Bing API for the image search, as does Mojeek. But what Brave doesn’t say is that it also taps in to Bing for site: searches, rendering them largely useless, too. Brave does a far better job than Bing in its regular search though, picking up lucire.com for Lucire as well as some major index pages.
 

On a regular search, Brave does rather well—it’s picked up the top pages.
 


Bing and Brave compared, using site:lucire.com. Brave isn’t as independent as you might think with site: and image searches. These screenshots were taken on Sunday.
 

Still well short of Mojeek in terms of its index—but then so is everyone aside from Google.

The saga continues, with still no one talking about Bing’s collapse (though I know of one journalist working away behind the scenes).


You may also like

Tags: , , , , , , , , , ,
Posted in branding, business, internet, technology, USA | No Comments »


Mystery sitemap files in Bing

12.08.2022

I only signed up to Bing Webmaster Tools when investigating why the company site did so poorly in Bing and Duck Duck Go—we now know it was nothing to do with us, and everything to do with a search engine basically disintegrating before our very eyes.

This, too, was interesting, from a screenshot dated July 20, 2022. I never added these sitemaps, and they all pre-date when I signed up to Webmaster Tools. They were all there when I went to the tools for lucire.com. They are not RSS feeds we’ve ever sanctioned, though of course someone could have created them intentionally to follow a subject. Maybe someone at Microsoft?
 

 

You may notice the number of pages: 51. These 51, however, have no real bearing on the 50-odd that Bing can display before it craps out.

I’ve since added sitemaps for the rest of the site, to no avail, natch.

Anyone else find weird sitemap files in their account after signing up?


You may also like

Tags: , , , ,
Posted in internet, technology | No Comments »