|By Avi Rosenthal||
|October 30, 2011 12:51 PM EDT||
Searching the Web
Search Engine is Google's most strategic product. Some of its other products are using it as component of the services they provide.
Limitation or failure of it, as described in Why we desperately need a New (and Better) Google cited in part1 of this post, could negatively affect Google's market position.
This is a major challenge facing Google. Google has no control of some of the factors limiting its effectiveness.
Challenges in Searching the Web
The following bullets describe major Web Data problems, which could affect Web Search:
- The data is growing exponentially
The amount of data is growing fast. A Search Engine should handle larger amounts of pages and probably will find more and more number of pages in each search operation results list.
Implications: Users will not read all entries in search results list. They will focus in the first entries. Search Engines results order should be accurate so users could identify the most relevant and important pages.
- No data Clean up Procedures
The common denominator of all of them is the necessity to manage them and to move to garbage items which are not in use and probably will not be used in the future. For example, you have to avoid of Virtualization Sprawl by deleting unused Virtual Machines in a Cloud, you have to delete temporary files in a PC or a Server. Spam, old or not important e-mail messages are deleted. Other e-mail messages are archived. Userids and their related information are deleted or at list should be deleted when the employee or client stops working.
Deletion and other Management Operations are done automatically or manually, but in all cases there is an owner or an administrator, who devises and performs a policy.
No one manage the Web. No one cleanup web pages except the owner of the page, who usually has no motivation for deleting web pages.
Implications: The result of high growth rate coupled with no management is too many Web pages and too many irrelevant pages.
- Data Reliability
Some pages are reliable and others are not. It is not an easy task to identify the reliable and valuable pages. For information read a previous post:
Search Engines should be able to assess the reliability of Web pages. It is not an easy task.
Implications: Reliable Web Pages should appear in the top of Search Results; otherwise users will read mostly unreliable sources.
- Multi data types
The data is not text only. It includes Videos, Pictures and Voice etc.
Non-text data size is a lot larger than text data size.
Implications: Searching non-text pages is not as easy as searching text. The Search Engine has to support new search types e.g. search of Images or Videos or even multi-type search e.g. text or images sharing a common Keyword.
The Web expansion or Entropy or exponential data growth is a major challenge for applications manipulating Web Data including Search Engines.
Semantic Web, Web 3.0, Big Data are attempts to address or minimize the effects of this problem.
main challenge is rating the Reliability and Value of Web Pages. Page Rank is an example of an algorithm for addressing this challenge.
Unique Google's Challenges
In addition to the need to cope with these general challenges Google has to cope with unique challenges, due to its position as the Search market leader with approximately 80% market share.
The challenges are not as simple as using an automated or human method of multiple clicking on advertisement. They are derived from attempts to fool Google Search Engine algorithms in order to move a Web page to the beginning of search results.
The following bullets provides three frequent mechanisms for fooling Google Search Engine or at least attempting to fool it:
1. Payment for an automated service which supposed to place a Web page in one of the first entries of Google's Search results list.
The paid service will access the Web page artificially many times as possible, preferably from highly ranked Web sites. This method could improve the Page Rank rating and place it in higher position than it should be positioned in Google searches.
2. Adding unrelated popular labels to a Web Page labels list.
This technique could show a Web Page in Search operations unrelated to its content. It may also position it higher than it should in Search results list
3. Cutting and Pacing full Web Pages or parts of Web Pages from other Web sites
For example, by copying a Wikipedia article content.
The page design may look perfect and the content could be reliable, but it attracts readers to a Web Site in which other pages are not reliable and are not well designed. Google's Search algorithm could rate the page higher than it should.
Google's Action: The Company announced recently that it will pay more for clicking advertisements in sites having original content.
Google's survival depends mainly upon two related domains: Web Search and its effective advertisement based business model.
The Web Search Engine is not as good as it was few years ago, due to Web Data exponential growth and fundamental changes of data characteristics. In addition to these factors, due to Google's dominance of this market users develop various mechanisms for fooling Google Search Engine.
Google challenge is to adapt and evolve its Search Engine. Google may need to find new creative ways to evolve its search Engine because nobody has Web Data Control, so the data quantity and characteristics could change significantly in the future.
As far as the Business model is concerned, it will be very difficult to replace it by entirely different model. However, the probability that an advertisement based business model will not be viable is low.
If the advertisement based business model will continue to be a viable model, Google's position will depend upon using it effectively in Search Page in conjunction with creating additional income sources from advertisement related to new services and Business Lines such as Android and You Tube TV.
The age of Digital Disruption is evolving into the next era – Digital Cohesion, an age in which applications securely self-assemble and deliver predictive services that continuously adapt to user behavior. Information from devices, sensors and applications around us will drive services seamlessly across mobile and fixed devices/infrastructure. This evolution is happening now in software defined services and secure networking. Four key drivers – Performance, Economics, Interoperability and Trust ...
May. 1, 2017 02:15 AM EDT Reads: 1,187
With billions of sensors deployed worldwide, the amount of machine-generated data will soon exceed what our networks can handle. But consumers and businesses will expect seamless experiences and real-time responsiveness. What does this mean for IoT devices and the infrastructure that supports them? More of the data will need to be handled at - or closer to - the devices themselves.
May. 1, 2017 12:30 AM EDT Reads: 1,288
SYS-CON Events announced today that CollabNet, a global leader in enterprise software development, release automation and DevOps solutions, will be a Bronze Sponsor of SYS-CON's 20th International Cloud Expo®, taking place from June 6-8, 2017, at the Javits Center in New York City, NY. CollabNet offers a broad range of solutions with the mission of helping modern organizations deliver quality software at speed. The company’s latest innovation, the DevOps Lifecycle Manager (DLM), supports Value S...
May. 1, 2017 12:15 AM EDT Reads: 1,553
Multiple data types are pouring into IoT deployments. Data is coming in small packages as well as enormous files and data streams of many sizes. Widespread use of mobile devices adds to the total. In this power panel at @ThingsExpo, moderated by Conference Chair Roger Strukhoff, panelists will look at the tools and environments that are being put to use in IoT deployments, as well as the team skills a modern enterprise IT shop needs to keep things running, get a handle on all this data, and deli...
Apr. 30, 2017 11:45 PM EDT Reads: 2,864
In his keynote at @ThingsExpo, Chris Matthieu, Director of IoT Engineering at Citrix and co-founder and CTO of Octoblu, focused on building an IoT platform and company. He provided a behind-the-scenes look at Octoblu’s platform, business, and pivots along the way (including the Citrix acquisition of Octoblu).
Apr. 30, 2017 11:00 PM EDT Reads: 1,943
The Internet of Things is clearly many things: data collection and analytics, wearables, Smart Grids and Smart Cities, the Industrial Internet, and more. Cool platforms like Arduino, Raspberry Pi, Intel's Galileo and Edison, and a diverse world of sensors are making the IoT a great toy box for developers in all these areas. In this Power Panel at @ThingsExpo, moderated by Conference Chair Roger Strukhoff, panelists discussed what things are the most important, which will have the most profound e...
Apr. 30, 2017 10:45 PM EDT Reads: 2,631
Grape Up is a software company, specialized in cloud native application development and professional services related to Cloud Foundry PaaS. With five expert teams that operate in various sectors of the market across the USA and Europe, we work with a variety of customers from emerging startups to Fortune 1000 companies.
Apr. 30, 2017 10:00 PM EDT Reads: 2,687
SYS-CON Events announced today that Grape Up will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct. 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Grape Up is a software company specializing in cloud native application development and professional services related to Cloud Foundry PaaS. With five expert teams that operate in various sectors of the market across the U.S. and Europe, Grape Up works with a variety of customers from emergi...
Apr. 30, 2017 09:45 PM EDT Reads: 2,530
Financial Technology has become a topic of intense interest throughout the cloud developer and enterprise IT communities. Accordingly, attendees at the upcoming 20th Cloud Expo at the Javits Center in New York, June 6-8, 2017, will find fresh new content in a new track called FinTech.
Apr. 30, 2017 09:45 PM EDT Reads: 2,726
@ThingsExpo has been named the Most Influential ‘Smart Cities - IIoT' Account and @BigDataExpo has been named fourteenth by Right Relevance (RR), which provides curated information and intelligence on approximately 50,000 topics. In addition, Right Relevance provides an Insights offering that combines the above Topics and Influencers information with real time conversations to provide actionable intelligence with visualizations to enable decision making. The Insights service is applicable to eve...
Apr. 30, 2017 09:30 PM EDT Reads: 3,221
SYS-CON Events announced today that Interoute, owner-operator of one of Europe's largest networks and a global cloud services platform, has been named “Bronze Sponsor” of SYS-CON's 20th Cloud Expo, which will take place on June 6-8, 2017 at the Javits Center in New York, New York. Interoute is the owner-operator of one of Europe's largest networks and a global cloud services platform which encompasses 12 data centers, 14 virtual data centers and 31 colocation centers, with connections to 195 add...
Apr. 30, 2017 09:15 PM EDT Reads: 2,358
DevOps is often described as a combination of technology and culture. Without both, DevOps isn't complete. However, applying the culture to outdated technology is a recipe for disaster; as response times grow and connections between teams are delayed by technology, the culture will die. A Nutanix Enterprise Cloud has many benefits that provide the needed base for a true DevOps paradigm.
Apr. 30, 2017 08:30 PM EDT Reads: 1,463
20th Cloud Expo, taking place June 6-8, 2017, at the Javits Center in New York City, NY, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy.
Apr. 30, 2017 08:15 PM EDT Reads: 3,577
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo 2016 in New York. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place June 6-8, 2017, at the Javits Center in New York City, New York, is co-located with 20th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry p...
Apr. 30, 2017 08:00 PM EDT Reads: 1,775
Today we can collect lots and lots of performance data. We build beautiful dashboards and even have fancy query languages to access and transform the data. Still performance data is a secret language only a couple of people understand. The more business becomes digital the more stakeholders are interested in this data including how it relates to business. Some of these people have never used a monitoring tool before. They have a question on their mind like “How is my application doing” but no id...
Apr. 30, 2017 07:45 PM EDT Reads: 7,482
With major technology companies and startups seriously embracing Cloud strategies, now is the perfect time to attend @CloudExpo | @ThingsExpo, June 6-8, 2017, at the Javits Center in New York City, NY and October 31 - November 2, 2017, Santa Clara Convention Center, CA. Learn what is going on, contribute to the discussions, and ensure that your enterprise is on the right path to Digital Transformation.
Apr. 30, 2017 07:30 PM EDT Reads: 1,803
@GonzalezCarmen has been ranked the Number One Influencer and @ThingsExpo has been named the Number One Brand in the “M2M 2016: Top 100 Influencers and Brands” by Analytic. Onalytica analyzed tweets over the last 6 months mentioning the keywords M2M OR “Machine to Machine.” They then identified the top 100 most influential brands and individuals leading the discussion on Twitter.
Apr. 30, 2017 07:15 PM EDT Reads: 1,645
Five years ago development was seen as a dead-end career, now it’s anything but – with an explosion in mobile and IoT initiatives increasing the demand for skilled engineers. But apart from having a ready supply of great coders, what constitutes true ‘DevOps Royalty’? It’ll be the ability to craft resilient architectures, supportability, security everywhere across the software lifecycle. In his keynote at @DevOpsSummit at 20th Cloud Expo, Jeffrey Scheaffer, GM and SVP, Continuous Delivery Busine...
Apr. 30, 2017 07:00 PM EDT Reads: 1,509
Most technology leaders, contemporary and from the hardware era, are reshaping their businesses to do software in the hope of capturing value in IoT. Although IoT is relatively new in the market, it has already gone through many promotional terms such as IoE, IoX, SDX, Edge/Fog, Mist Compute, etc. Ultimately, irrespective of the name, it is about deriving value from independent software assets participating in an ecosystem as one comprehensive solution.
Apr. 30, 2017 06:45 PM EDT Reads: 799
SYS-CON Events announced today that T-Mobile will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. As America's Un-carrier, T-Mobile US, Inc., is redefining the way consumers and businesses buy wireless services through leading product and service innovation. The Company's advanced nationwide 4G LTE network delivers outstanding wireless experiences to 67.4 million customers who are unwilling to compromise on ...
Apr. 30, 2017 06:15 PM EDT Reads: 1,677