I am new here and would like to know the process to connect Hadoop cluster from Yellowfin (aws).
Can some one guide me.
Thanks in advance.
I hope all is well,
As this is a 3rd party connection (external to YF) we can't fully support this, with this I can only provide you with some documentation of which should assist you with connecting to Hoop. Please see the links below. I hope this helps Anji.
Link 1 | Link 2
Thanks Mark for the reply. However I am not able to access the links. It says no permission to view
Apologies for this, see below for this information:
Does Yellowfin support Hadoop and Hive configurations?
As of 6.3 Yellowfin supports Hadoop and Hive!
What version of Hive does Yellowfin support?
We did our initial Yellowfin testing with Hive version 0.10, however we do not expect any issues with newer Hive versions, provided you have placed all the required Hive files within a common directory, and specified this directory when you create a Hive connection in Yellowfin. We have also completed testing with Intel's Hive distribution which seems to be based on Hive 0.11
Which required .jar files for Apache Hive 0.13 and where do I have to place them? We are running Yellowfin 7.1.
The exact listing of jar file changes according to what version of Hive is being used, so the safest thing to do is to a file search for the hive-jdbc.jar, then when you find out which lib folder it is in, just copy the whole folder into the Yellowfin installation and then point Yellowfin to that folder.
Obviously, not all of the jar files are needed, so if you’ve got enough time after you have a successful connection using all of those jar files, go through one by one removing one jar at a time and see if the connection still works.
Can Yellowfin connect to Hadoop?
Below are some links around how Yellowfin can connect to Hadoop, you'd connect through Hive.
Excerpt from the link above:
How does Yellowfin connect to, and extract data from, Hadoop?
“Yellowfin can take advantage of Hadoop’s awesome Big Data processing power by connecting to Hive.
“Hive is a database system that runs on-top-of Hadoop, providing an SQL-like language for tabular data selection and filtering.
“Hive can take advantage of the distributed nature of the Hadoop file system, allowing for petabytes of storage, using Map-Reduce functions to implement SQL-like queries.
“The Yellowfin interface allows users to quickly author reports and visualizations on Big Data, just like any other relational data source.
“When support for a new database management system is added to Yellowfin, an interface is developed describing the capabilities of the database. This allows Yellowfin to hide functionality that may not be available on a particular system. However, the SQL supported by Hive allows the availability of all functions.
“The implementation of a database interface for Hive in Yellowfin means that connection can be made using the connection wizard, prompting for host and port details, rather than entering a straight JDBC url.”
- See more at: http://www.yellowfinbi.com/YFCommunityNews-Addressing-Big-Data-analytics-with-Hadoop-Q-A-with-chief-Yellowfin-architect-Pet-116595#sthash.dnwTLpYO.dpuf
These are some youtube videos on our Yellowfin channel as well:
I hope this helps Anji,
I just want to touch base to see if the information was helpful and whether or not you would mind if I went ahead and marked this Question as Answered. I look forward to your reply.
I am going to go ahead and mark this Question as Answered. If you need any further information on this subject, please do not hesitate to contact us.
Comments have been locked on this page!